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1.  Introduction 


The  main  objective  of  this  project  is  to  investigate  the  application  of  shape  memory  alloys  in 
the  robust  control  of  smart  structures.  This  objective  was  accomplished  by  developing: 

(i)  new  shape  memory  alloys  using  electrodeposition  process, 

(ii)  design  and  implementation  of  robust  controllers  for  smart  structures, 

(iii)  development  of  adaptive  control  techniques,  and 

(iv)  experimental  verification  of  dynamic  responses  of  smart  structures. 

This  final  report  summarizes  the  total  effort  on  this  project,  including  the  development  of 
new  shape  memory  alloys,  design,  simulation  studies,  and  implementation  of  robust  and  adaptive 
controller  on  smart  structures. 

Major  Accomplishments 

(1)  The  shape  memory  alloy  indium-thallium  (In-Tl)  is  produced  by  electrodeposition 
techniques.  The  effect  of  pulse  plating  parameters  on  the  compositions  of  the  In-Tl 
alloy  was  investigated.  The  relationship  between  composition  and  phase  structure  of 
the  alloys  deposited  was  investigated  with  regard  to  the  shape  memory  effect.  Not 
only  did  the  deposited  alloys  exhibit  the  shape  memory  effect,  but  the  composition- 
phase  relationship  was  shown  to  deviate  from  that  reported  in  alloys  in  an  equilibrium 
condition.  Since  the  stiffness  of  the  In-Tl  alloy  is  low  and  thallium  is  quite  toxic,  the 
indium-cadmium  (In-Cd),  copper-zinc  (Cu-Zn)  shape  memory  alloys  were  produced 
using  the  electrodeposition  process. 

(2)  Cu-Zn,  Au-Cd,  In-Tl  and  In-Cd  alloy  films  were  produced  using  electrolytic 
techniques  and  their  shape  memory  properties  were  evaluated.  Cyanide  solutions 
were  used  for  the  Cu-Zn  and  Au-Cd  deposits,  but  sulfate  solutions  were  preferred  for 
the  indium  alloys.  The  use  of  pulsed  current  was  beneficial  in  giving  dense  and 
uniform  deposits.  The  Cu-Zn  and  Au-Cd  alloys  did  exhibit  a  degree  of  brittle 
behavior  in  the  as-deposited  condition.  Simple  bending  tests  were  conducted  and 
shape  memory  effects  were  confirmed  on  the  Cu-Zn  and  indium  alloys. 
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(3)  To  demonstrate  the  capabilities  of  smart  structures,  a  simple  cantilever  beam  and  a  3- 
mass  system  (test  articles)  were  designed  and  fabricated  by  incorporating  active 
sensors  and  actuators.  The  electronic  circuits  for  signal  processing  and  drivers,  and 
data  acquisition  systems  have  been  designed,  fabricated  and  mounted  on  the  test 
articles.  The  shape  memory  alloy,  Nitinol,  was  used  for  actuator  purposes. 

(4)  The  mathematical  models  for  structural  systems  was  developed  by  using  finite 
element  models  and  structural  identification  techniques.  The  results  of  structural 
identification  were  utilized  in  validating  the  finite  element  models.  The  structural 
identification  method,  Eigensystem  Realization  Algorithm  (ERA),  was  adopted  for 
determining  mathematical  models  for  test  articles. 

(5)  To  minimize  the  effects  of  uncertainties  on  the  closed-loop  system  performance  of 
smart  structures,  robust  controllers  were  designed  and  implemented  on  test  articles. 
A  modified  robust  control  methodology  was  developed  to  accommodate  the  limited 
control  force  provided  by  shape  memory  actuators.  The  robustness  properties  of 
controllers  have  been  verified  experimentally. 

(6)  We  have  also  developed  techniques  for  identification  and  control  using  artificial 
neural  networks.  We  have  developed  a  multi-layer  neural  network  architecture  for 
implementing  structural  identification  method  (ERA)  and  experimentally  implemented 
on  test  articles.  We  have  introduced  adaptive  neuron  activation  function  and  an 
accelerated  adaptive  learning  rate  algorithm  for  reducing  the  training  time  of  neural 
networks.  The  robust  controller  was  implemented  on  a  simple  cantilever  beam  using 
a  neural  network. 

(7)  The  application  of  adaptive  control  algorithm  for  vibration  suppression  of  smart 
structures  was  investigated  in  this  project.  The  conventional  model  reference  adaptive 
control  algorithms  were  successfully  implemented  on  the  structural  test  articles. 

(8)  We  have  established  excellent  interdisciplinary  laboratory  facilities  for  conducting 
research  in  smart  structures  and  neural  networks. 

(9)  Based  on  the  results  accomplished  in  this  project,  we  have  submitted  a  proposal  to  the 
National  Science  Foundation  for  the  development  of  curriculum  in  the  smart 
structures  area.  The  NSF  has  approved  this  project  for  funding. 


2 


2.  Fabrication  of  Shape  Memory  Alloys  Using  Electrodeposition  Process 

As  the  use  of  shape  memory  alloys  (SMAs)  grows,  there  is  an  increasing  interest  in 
developing  new  synthesis  techniques  for  their  production.  Conventionally,  shape  memory  alloys 
are  produced  by  melting,  casting  and  shaping  followed  by  a  heat  treatment  cycle  to  give  the  desireu 
properties  [  1  ].  Powder  metallurgy  [2,3]  is  also  a  popular  processing  method  because  finer  grained 
metals  that  exhibit  reproducible  memory  characteristics  can  be  produced.  Sputtering  [4]  has  been 
investigated  to  prepare  shape  memory  films  and  although  the  technique  is  still  at  the  developmental 
stage,  the  initial  results  appear  quite  promising.  Rapid  solidification  techniques,  including  melt 
spinning  [5,6,7],  are  also  gaining  in  popularity  for  the  manufacture  of  shape  memory  ribbons  or 
wires. 


In  view  of  this  trend,  studies  were  undertaken  to  assess  the  use  of  electrolytic  methods  for 
making  alloys  which  exhibit  the  shape  memory  effect.  It  is  well  documented  that  the  phase- 
composition-temperature  relationship  for  alloys  made  electrolytically  can  vary  substantially  from 
similar  alloys  made  thermally  [8],  Relatively  stable,  but  non-equilibrium  structures  are  common;  a 
condition  probably  caused  by  the  internal  stress  level  inherent  to  thin  film  deposition  processes.  If 
a  similar  phenomenon  occurred  with  electrodeposited  SMA'S,  then  advantageous  changes  in 
properties  might  also  result  in  these  films  and  transformation  behavior  altered  by  varying  the 
operating  parameters  used  during  processing. 

Electrodeposition  offers  several  unique  features  in  that  alloys  can  be  deposited  as  thin 
films,  in  situ,  at  ambient  temperature.  An  additional  advantage  is  relatively  low  capital  and 
operating  cost,  and  high  volume  manufacture  with  relatively  good  reproducibility  and  control.  One 
major  limitation  is  that  a  number  of  important  metals,  such  as  aluminum  and  titanium,  can  not  be 
electrodeposited  from  aqueous  electrolytes. 

The  primary  objective  of  this  research  was  to  investigate  the  feasibility  of  using  electrolytic 
techniques  for  preparing  shape  memory  alloys.  The  Cu-Zn,  Au-Cd,  In-Tl  and  In-Cd  systems 
were  chosen  to  test  the  concept  since  they  can  be  electrodeposited  from  aqueous  solutions.  The 
study  focused  on  the  electrolytic  production  and  characterization  of  the  alloys,  with  emphasis  on 
the  composition  and  transformation  temperature  relationships. 
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2.1  Shape  Memory  Effect 


The  shape  memory  effect  occurs  if  the  alloy  ho^  the  ability  to  exist  in  the  martensitic  crystal 
form  at  one  temperature  but  will  transform  to  the  parent  phase  at  an  elevated  temperature.  This 
phenomenon  is  found  in  many  alloy  systems  and  a  listing  of  the  more  common  systems  is  given  in 
Table  2.1  [9], 


Table  2. 1  Shape  Memory  Alloys 


Alloy  Remark* 

(auatenlte/martenslte) 


Peeudoeleatlc(SIM) 

Cu-Zn 

Cu-Zn-X 

Cu-Zn-Sn 

Cu-AI-Ni 

Cu-AI-Mn 

Afl-Cd 

Au-Cd 

CuAuZn 

Fe,Be 

FejPt 

In-TI 

NI-TI 

Au-Cu-Zn 

Cu-Sn 

ahmpe  memory  effect 

In-Cd 

Ti-Nb 

304  stainless  steel 
Ni-AI 

Cu-Zn(-X) 

Afl-Cd 

Fe-NI 

Ni-Ti 

Cu-AI(-Ni) 

Au-Cd 
I  n 


bcc/orthorhombic(3R) 

bcc/orthorhombic(3R) 

bcc/orthorhombic 

bcc/orthorhombic(2H) 

bcc/orthorhombic(2H) 

bcc/orthorhombic(2H) 

bec/orthorhombic(2H) 

bcc/orthorhombic(3R) 

fcc/tetragonal 

fcc/tetraflonal 

fcc/tetragonal 

bcc/orthorhombic 

bcc/2H  ♦  1BR 

bcc/orthorhombic(2H) 

fcc/tetragonal 
fcc/bcc 
bcc/hcp 
bcc/CuAul  type 

bcc/orthorhombic 


The  stability  of  the  alloy  phases  of  various  compositions  in  shape  memory  systems  with 
changes  in  temperature  has  been  studied  for  a  number  of  alloys,  and  is  usually  indicated  by  the 
temperature  at  which  the  martensitic  transformation  occurs.  When  an  external  stress  is  introduced 
into  the  specimen,  the  situation  changes.  In  a  homogeneous  system,  the  relation  between  Ms  and 
the  composition  is  modified  to  account  for  the  effect  of  the  external  factor.  Thus  for  example,  the 
stability  of  the  parent  phase  (austenite)  and  the  products  (martensite)  does  not  only  depend  on  the 
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changes  in  the  temperature  and  the  composition  but  is  strongly  influenced  by  both  the  shear  and  the 
volume  stresses. 


(Strain) 


a:  Austenite  Finish 
b:  Martensite  Start 
e:  Martensite  Finish 


Fig.  2.1  Macroscopic  representation  of  the  shape  memory  effect 

In  Fig.  2.1,  this  effect  is  accounted  for  in  a  stress-strain-temperature  field  and  the  stability 
of  thermal  martensite  (a),  stress  induced  martensite  (SIM,oQ,  and  high  temperature  austenitic 
phase  (y).  The  relationship  between  stress  and  temperature  can  be  expressed  by  the  Clausius- 
Claperon  type  of  equation  applied  to  SIM  [10]. 

d(J(j/dT  =  AH/  eT0  Vm 

where,  Od  is  the  stress  for  SIM  formation. 

AH  is  enthalpy  of  the  SIM  formation. 

T0  is  the  equilibrium  temperature  for  austenite  and  SIM. 
e  is  the  strain  attained  at  complete  martensite  transformation. 
Vm  is  the  molar  volume. 

Fig.  2. 1  also  shows  that  the  stress  for  a  and  a'  formation  decreases  with  decreasing  temperature 
above  Ms  (martensite  start)  due  to  an  increase  in  instability  of  the  parent  y  phase,  becoming  zero  at 
Ms.  On  reaching  Ms,  further  cooling  causes  more  martensite  to  form  spontaneously  and  continues 
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until  Mf  (martensite  finish)  is  reached  where  the  specimen  completely  transforms  to  martensite.  If 
an  external  stress  (less  than  the  yield  stress)  is  applied,  the  strain  on  the  specimen  is  increased  to 
€d,  the  specimen  retains  the  same  dimensions  even  upon  removal  of  the  stress.  On  heating,  the 
reverse  martensite  transformation  starts  and  the  fraction  of  the  martensite  decreases  until  Af 
(austenite  finish)  is  reached  where  a  fully  austenitic  phase  is  attained  with  the  result  that  the 
specimen  recovers  its  initial  predeformed  shape. 

The  total  strain  may  or  may  not  be  completely  recovered.  If  the  specimen  is  strained  too 
far,  the  irreversible  deformation  introduced  will  not  be  recovered  at  any  temperature.  This 
irreversible  portion  of  the  strain  is  responsible  for  the  partial  shape  recovery  behavior  observed  in 
many  systems. 

2.2  Electrochemical  Synthesis  of  Shape  Memory  Alloy 


Electrodeposition  Process 


Metal  electrodeposition  involves  an  electrochemical  reaction  which  is  the  reductive 
discharge  of  ions  at  the  electrode  surface,  simply  illustrated  by  the  half  cell  reaction  M+n  +  ne-  -* 
M.  The  mechanistic  sequence  for  this  reaction  is  usually  described  by  identifying  the  regions  in 
which  the  ion  undergoes  the  changes  preceding  incorporation  into  the  crystallite  lattice. 


I 

I 
I 
I 
I 

¥1 
I 
I 
I 
I 

Diffusion  Bulk 

Layer  Electrolyte 


Fig.  2.2  Mechanistic  steps  in  metal  deposition 
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As  shown  in  Fig.  2.2,  a  hydrated  ion  arrives  at  the  diffusion  layer  from  the  bulk  under  the 
influence  of  the  imposed  electrical  field  as  well  as  by  convection.  The  ion  passes  through  a  stagnant 
region  called  the  diffusion  layer  5  and  the  driving  force  is  the  chemical  gradient.  Also,  the 
movement  of  electrolyte  is  quite  limited  in  adjacent  layers  due  to  viscosity  forces.  If  the  process  is 
mass  transfer  controlled  the  local  concentration  changes  of  the  reacting  species  due  to  the  relatively 
slow  diffusion  rate  dominate  in  this  region.  The  thickness  of  the  layer  can  be  estimated 
experimentally  and  relates  to  the  maximum  current  allowed  for  the  electrolysis  called  the  "limiting 
current".  The  thickness  of  the  layer  reflects  the  interaction  of  the  dynamic  forces  due  to  the 
electrolyte  movement  or  flow  in  the  bulk  phase  and  the  forces  of  viscosity  damping  the  movement 
near  the  electrode  surface.  In  practical  systems,  the  diffusion  layer  thickness  is  about  0.5  mm.  In 
systems  with  fast  flow  or  intensive  agitation,  the  value  could  be  an  order  of  magnitude  smaller. 
Electrolytic  deposition  of  metals  is  usually  performed  well  below  the  limiting  current  of  the 
deposition  reaction.  If  metals  are  electrodeposited  near  the  limiting  current  a  nodular,  impure, 
porous  and  even  powdery  deposit  may  be  produced. 


A  multitude  of  similar  occurrences  takes  place  over  the  entire  area  of  the  electrode,  thus 
causing  a  very  dynamic  situation  of  ion  movement.  Since  electrical  neutrality  must  be  maintained, 
the  anions  and  their  distribution  must  also  be  considered.  The  current  thinking  is  that  very  distinct 
zones  exist  at  the  electrode  interface,  which  are  a  function  of  the  high  field  strength,  hydration  effects 
and  ion  charge  distribution.  In  simple  terms  the  redistribution  of  electrical  charge  at  the  phase 
boundary  may  be  represented  as  two  parallel  sheets  of  charge  of  opposite  sign.  The  region  where 
this  is  observed  is  called  the  "double  layer",  and  a  schematic  representation  is  given  in  Fig.  2.3. 

O  H  2° 

IHP(anions)  /!**\ 

|  OHP(anions)  (Hla  Cation 

I 


O 


Anion 


Cation  electrf  statically 
I  held  in  double  layer 


fioTO 

Double  1  Diffusion  Bulk 

Layer  »  ^  Layer  ^  I  ^  Electrolyte 

Fig.  2.3  Representation  of  double  layer 
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The  nature  of  the  layers  next  to  the  electrode  can  have  a  profound  effect  on  the 
electrodeposition  of  metals.  The  thickness  of  the  layer  depends  on  the  level  of  agitation  or  stirring, 
other  ions  present  in  the  electrolyte,  temperature,  and  other  factors.  The  electric  field  strength  of 
the  double  layer  is  found  to  be  on  the  order  of  107  V/cm.  It  is  this  strong  field  acting  upon 
molecules  and  ions  during  the  electrochemical  reactions  which  so  greatly  affects  the  reaction  rates, 
and  the  probability  of  processes  occurring  at  electrode  interface.  In  sum,  it  is  these  non¬ 
equilibrium  characteristics  make  the  electrochemical  reactions  strong  candidates  to  produce  deposits 
with  unique  or  unusual  properties. 

Electrolytic  Synthesis  of  Alloy  Films 

Cu-Zn  alloys 

Brass  alloy  films  were  prepared  using  two  electrolytic  techniques,  pulse  electroplating  and 
spontaneous  cementation  from  cyanide  solutions. 

The  composition  of  the  electrolyte  for  pulse  plating  was  96  g/1  of  copper  cyanide,  10  g/1  of 
zinc  cyanide,  150  g/1  of  sodium  cyanide  and  44  g/1  of  sodium  hydroxide.  Bath  temperature  was 
held  at  45°  C  for  pulse  plating.  In  pulse  plating,  rotating  stainless  steel  disc  electrode  was  used  as  a 
cathode  and  brass  for  anode.  The  parameters  such  as  power  on  and  off  time  and  peak  current 
density  were  varied  as  was  the  metallic  ionic  concentration  in  the  solution  to  obtain  the  desired 
alloy  compositions.  Previous  studies  [11-14]  have  shown  that  when  metals  and  alloys  are 
deposited  using  pulsed  current,  the  mechanical  properties  were  often  better  than  those  of 
conventionally  plated  alloys.  Of  more  significance  in  this  study  was  the  phase  stability  of  the 
deposits  for  different  compositions  and  structures.  Fig.  2.4  simply  shows  the  wide  variation  of 
the  compositions  obtained  by  changing  the  operating  parameters. 
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Fig.  2.4  Relationship  between  copper  content  of  the  deposit  and  pulse  parameters 

Cemented  brass  films  were  produced  by  contacting  an  aluminum  sheet  with  a  solution 
containing  copper  and  zinc  ions.  The  brass  deposit  was  thought  to  be  the  result  of  the 
displacement  reaction  to  produce  a  solid  solution  alloy  [Cu-Zn]: 

Cu+  +  Zn2+  +  A1  =  [Cu-Zn]  +  Al3+ 

where  [Cu-Zn]  is  a  solid  solution. 

The  composition  of  the  deposits  and  phases  formed  were  varied  by  adjusting  the 
concentration  of  the  electrolyte  and/or  by  altering  the  temperature  of  the  electrolyte.  The  film 
thickness  produced  varied  from  about  20  to  40  pm  depending  upon  the  plating  time,  concentration 
and  temperature  of  the  electrolyte. 

Au-Cd  alloys 

The  gold-cadmium  film  used  in  the  low  temperature  study  was  obtained  using  standard  DC 
plating  technique.  The  formulation  of  the  cyanide  solution  (CdO:2.8  g/1,  KOH:5  g/1,  KCN:25  g/1 
and  KAu  (CN)2:0.4  g/1)  and  the  choice  of  plating  conditions  were  based  on  analyses  of  some 
deposits  made  during  screening  tests  in  which  the  electrochemical  parameters  were  varied.  The 
electrolyte  consists  Depositions  were  made  at  different  temperatures  using  a  stationary  stainless 
steel  cathode. 


9 


Indium  alloys 


A  sulfate  solution  containing  40.6  g/1  of  indium  sulfate,  2.2  g/1  of  thallium  sulfate,  and  28 
ml/1  of  sulfuric  acid  were  used  to  produce  indium-thallium  and  52  g/1  of  indium  sulfate,  2  g/1  of 
cadmium  sulfate  and  10  g/1  of  sodium  sulfate  was  used  for  indium-cadmium  alloy  films.  Platinum 
foil  was  used  as  an  anode  and  the  cathode  substrate  was  a  glassy  carbon  plate.  In-Tl  alloy  films 
were  electrodeposited  using  pulsed  current  generated  by  a  Dynatronix  Model  DPR  20- 1  -3  rectifier. 
To  obtain  different  compositions,  various  pulse  parameters  such  as  peak  current  density  (50-100 
mA/cm2),  on  and  off  time  (10-54  msec)  were  used.  The  bath  temperature  was  varied  from 
ambient  to  45°  C.  In-Cd  alloys  were  produced  by  EXT  plating  at  various  current  densities.  The 
average  thickness  of  the  deposits  was  about  50  |im. 

Phase  Transitions  and  Shape  Recovery 

Phase  transformations  were  investigated  as  a  function  of  temperature  using  a  Scintag  2000 
X-ray  diffractometer  with  low  temperature  chamber  or  high  temperature  furnace  attachment.  Phase 
structures  of  equilibrium  alloys  are  listed  in  Table  2.2  [15-18]. 

Table  2.2  Phase  Structures  of  SMAs 


Alloy  Phase 

_ Austenite  Martenalte 


Cu-Zn 

p  (CsCI) 

FCC, 

9R(monoclinic) 

3R(FCT) 

Au-Cd 

P(CsCI) 

B’(Orthorhombic), 

P  **  (Trigonal) 

In-Tl 

FCC 

FCT 

In-Cd 

FCC 

FCT 

Cu-Zn  alloys 

Since  most  of  the  brass  deposits  were  brittle,  characterization  studies  were  limited  to 
specimens  that  showed  sufficient  ductility  to  assure  adequate  bending.  One  example  of  X-ray 
results  for  electrolytic  brass  containing  45at%Zn  is  shown  in  Table  2.3. 
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Table  2.3  X-Ray  Diffraction  Data  of  Brass 


d 

25*C 

Spacing(A) 

-1 96°C  Calc. 

o.P 

Phasa 

2.27 

Martensite 

2.15 

2.14 

2.15 

a 

2.10 

2.08 

2.09 

P 

2.03 

Martensite 

1.86 

1.85 

1.86 

a 

1.48 

1.48 

1.48 

P 

1.32 

a 

1.21 

1.21 

1.21 

P 

1.12 

a 

1.07 

a 

1.05 

1.05 

1.05 

P 

From  the  equilibrium  diagram  for  the  composition  of  the  alloy  investigated,  a  (FCC)  and  (3  (BCC) 
phases  [19]  would  be  expected.  Comparing  the  results  made  at  liquid  nitrogen  temperature,  the 
room  temperature  phases  shows  additional  lines  at  d  =  2.27,  2.03  and  1 .76  A.  It  was  difficult  to 
specifically  identify  the  room  temperature  phases  because  of  the  small  number  of  peaks  present. 
However,  the  alloy  obviously  has  undergone  a  reversible  phase  transformation  during  cooling¬ 
heating  cycle. 

In  this  work,  the  electrolytic  brasses  were  found  to  show  structural  and  color  transitions 
when  the  specimens  were  cycled  between  the  test  temperatures.  At  room  temperature,  the  color  of 
the  films  were  copper-red,  but  turned  to  a  clear  yellow  color  in  the  liquid  nitrogen  as  the  specimen 
apparently  transformed  from  one  phase  to  another.  All  the  X-ray  patterns  made  for  the  electrolytic 
alloys  were  accounted  for  using  the  lines  corresponding  to  both  stress  induced  martensite  and 
thermally  formed  martensite  [20,21]  and  these  results  suggested  that  the  martensitic  phases  were 
able  to  be  stabilized  at  more  elevated  temperatures  by  the  deposition  process. 

The  structural  phase  transition  and  the  color  changes  noted  for  the  electrolytic  films  are 
comparable  to  the  martensite  3R  to  9R  martensite  or  a  3R  to  (3  phase  transition  sequence  in  Cu-Zn 
alloys  observed  by  Wayman  [19,20]  and  in  Cu-Al-Ni  [22].  The  change  in  color  and  the 
corresponding  phase  change  for  the  electrolytic  alloys  appeared  to  be  consistent  with  these 
literature  results,  but  the  color  changes  occurred  in  a  directly  opposite  direction.  The  (3  phase 
(yellow  color)  that  was  observed  in  thermally  prepared  alloys  at  room  temperature  was  present 
when  the  electrodeposited  specimens  were  immersed  in  liquid  nitrogen.  When  warmed  to  room 
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Table  2.4  X-ray  diffraction  data  for  Au-Cd 


d  Spacing  (A) 

25*C  *1 96*C 

Phase 

3.32 

P 

3.30 

r 

2.66 

r 

2.60 

p‘ 

2.40 

P' 

2.36 

p" 

2.354 

p" 

2.348 

p" 

2.34 

p* 

2.23 

P’ 

1.94 

p* 

1.66 

p 

1.65 

p" 

1.45 

p* 

1.40 

p“ 

1.35 

P" 

1.23 

p* 

1.18 

p 

1.16 

p- 

The  Au-Cd  alloy  films  were  generally  found  to  be  too  brittle  to  allow  detailed  studies 
involving  mechanical  working.  However,  some  simple  bending  and  heating  tests  were  done  on 
some  of  the  more  ductile  samples  to  study  the  shape  memory  effect.  The  film  consisting  of 
50at%Cd  occasionally  showed  a  slight  shape  recovery  when  deformed  in  liquid  nitrogen  and 
subsequently  heated  to  room  temperature. 


Indium  alloys 


A  surface  morphology  of  an  electrodeposited  In-Tl  alloy  obtained  by  scanning  electron 
microscope  is  shown  in  Fig.  2.5.  The  surface  of  the  deposit  is  dense  and  reasonably  uniform 
showing  various  crystallite  sizes  from  few  pm  to  20pm. 
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Transformation  Temperature  (°C) 


Fig.  2.5  Scanning  electron  micrograph  of  In-Tl  alloy  deposit 


Indium  alloys  with  thallium  content  of  18-21at%  and  cadmium  content  of  4-5at%  are 
known  to  undergo  FCC  (face  centered  cubic)-FCT  (face  centered  tetragonal)  phase  transformation 
above  room  temperature  and  show  shape  memory  effect.  Phase  structures  of  electrodeposited 
indium  alloys  were  investigated  using  high  temperature  X-ray  diffraction.  The  relationship 
between  phase  transformation  and  composition  of  electrodeposited  In-Tl  alloys,  given  in  Fig.  2.6 
[23,24],  shows  a  large  deviation  from  equilibrium  alloys  of  similar  composition.  In  fact,  the  phase 
transformation  temperatures  of  the  electrodeposited  alloys  are  higher  than  those  of  thermally  made 
equilibrium  ones.  However,  the  result  from  In-Cd  alloy,  given  in  Fig.  2.7,  show  the  opposite 
trend. 


Intensity 


Fig.  2.6  Relationship  between  composition  and  phase 
transition  temperature  for  In-Tl  alloys  [23,24] 


Fig.  2.7  X-ray  diffraction  profiles  for  ln-5.1at% 
Cd  alloy  film 
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The  as-deposited  alloy  has  austenite  (FCC)  and  martensite  (FCT)  mixture  at  room  temperature.  As 
the  alloy  is  heated,  transformation  occurs  and  completed  at  55°  C.  After  the  cooling  to  room 
temperature  again,  it  has  almost  austenite.  On  further  cooling,  martensite  transformation  takes 
place  and  completed  somewhere  below  zero  degree  in  Celsius.  When  the  deposit  is  warmed  up  to 
room  temperature  for  the  3rd  time,  it  has  now  austenite  and  martensite  mixture.  In  fact,  the 
transformation  temperature  of  electrodeposited  In-Cd  alloy  is  sub-zero. 


It  was  reported  by  Nittono  [25]  that  In-Tl  and  In-Cd  alloys  had  a  perfect  shape  memory 
effect.  The  schematic  shape  recovery  curve  of  an  indium  alloy  is  shown  in  Fig.  2.8.  The  shape 
recovery  can  be  divided  into  three  stages.  In  early  stage(l),  there  is  a  linear  recovery,  but  small  in 
quantity,  region  associated  with  reorientation  with  increasing  temperature.  A  remarkable  shape 
change  occurs  in  the  second  stage(II)  due  to  the  phase  transformation.  In  the  final  stage(III),  the 
recovery  is  not  appreciated  showing  that  the  shape  recovery  has  finished.  Some  qualitative  tests 
were  made  to  see  if  the  electrodeposited  alloys  exhibits  the  shape  memory  effect.  An 
electrodeposited  alloy  was  cut  into  a  narrow  strip  and  bent  into  a  ring  shape  at  room  temperature. 
It  was  hard  to  distinguish  the  recovery  regions  as  described  above  for  equilibrium  alloys. 
However,  when  the  sheet  was  heated,  movement  toward  its  original  shape  was  observed  showing 
at  least  partial  recovery  and  the  shape  memory  effect. 


2.2.1  Summary 


The  results  show  that  Cu-Zn  films  prepared  electroiytically  and  in  the  compositional  range 
(45-52at%Zn)  appear  to  be  martensitic  at  room  temperature.  The  transformation  from  martensite  to 
austenite  takes  place  when  the  crystal  phase  is  distorted  by  cooling  and  this  transformation  is  found 
to  be  thermally  reversible.  Au-Cd  alloy  showed  what  appeared  to  be  reversible  X-ray  patterns  but 
the  shape  response  was  apparently  irreversible.  The  transformation  temperatures  of 
electrodeposited  indium  alloys  show  a  large  deviation  from  those  of  equilibrium  ones,  showing 
higher  transformation  temperature  for  In-Tl  and  lower  for  In-Cd.  Alloy  film  prepared 
electroiytically  also  exhibit  some  useful  shape  recovery. 

Possible  reasons  for  the  electrochemically  synthesized  shape  memory  alloys  having 
unusual  properties  are  due  to  small  grain  size  of  the  deposit  or  internal  stress  developed  during  the 
process  by  adsorption  of  foreign  species,  building  up  of  defects  during  electrocrystallization,  or 
inclusion  of  gaseous  phases  evolved  during  electrolysis.  Despite  the  apparent  limitation  on  a  clear 
picture  of  the  phase  transformation  at  this  stage,  once  the  roles  of  the  factors  affecting  the 
properties  of  the  film  are  elucidated,  the  electrolysis  can  be  a  cheap  and  relatively  easy  way  to 
produce  the  shape  memory  alloy  films  with  intentionally  modified  shape  memory  properties. 
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2.3  Investigation  of  Shape  Memory  Properties  of  Electrodeposited  Indium- 
Thallium  Alloys 

Materials  which  exhibit  the  shape  memory  effect  are  finding  increased  usage  as  both 
sensors  and  actuators  in  smart  or  adaptive  structures  [26,28,30].  The  essential  requirement  for 
metal  alloys  to  be  classified  as  shape  memory  alloys  is  a  martensitic  transformation  upon  cooling, 
with  the  appropriate  phase  change.  Advantage  is  then  taken  of  the  dimensional  change  which 
accompanies  the  alteration  in  structure  with  temperature  to  perform  some  corrective  action  in  a 
structure.  Recently,  studies  in  our  laboratories  have  demonstrated  that  it  is  possible  to  produce 
alloys  which  exhibit  the  shape  memory  effect  using  electrodeposition  techniques.  This  unique 
processing  method  offers  a  number  of  potentially  attractive  features,  which  in  time  might  be 
incorporated  into  advanced  responsive  control  systems.  The  alloys  can  be  deposited  in  place,  at 
ambient  temperature,  in  thin  films  or  layers,  and  in  a  variety  of  structures  and  compositions. 

In  this  research,  the  objective  was  to  produce  alloys  electroiytically  which  had 
compositions  in  a  range  where  the  shape  memory  effect  was  known  to  occur.  The  indium-thallium 
system  was  chosen  to  show  the  feasibility  of  the  concept  and  research  was  focused  on  the  product 
of  these  alloys  in  a  composition  range  of  15  to  38  at%  Tl.  It  was  determined  that  homogeneous, 
dense  alloy  films  could  be  electrodeposited  using  pulsed  current  from  a  sulfate  electrolyte  and  the 
effect  of  pulse  parameters  on  the  composition  and  morphology  was  investigated.  The  phases 
existing  in  the  electrodeposited  alloys  above  room  temperature  were  determined  by  high 
temperature  X-ray  diffraction  techniques. 

2.3.1  The  Shape  Memory  Effect 

The  mechanism  responsible  for  the  shape  memory  effect  involves  the  presence  of  a 
martensitic  phase  transformation  during  temperature  change.  The  temperature  at  which  the 
martensitic  transformation  starts  on  cooling  is  called  the  Ms  temperature  and  the  temperature  at 
which  the  martensitic  transformation  stops  is  called  Mf.  For  the  reverse  transformation,  these 
temperatures  for  conversion  back  to  the  parent  phase  are  called  As  and  Af,  respectively.  The  shape 
memory  effect  can  be  explained  by  the  stress-strain-temperature  relationship  of  the  alloy,  shown  in 
Figure  2.9.  The  alloy  is  cooled  below  its  Mf  temperature  (a  to  b).  Then,  the  alloy  is  deformed  in 
the  martensitic  state  (b  to  c)  and  unloaded  (c  to  d).  There  is  a  permanent  strain  present  but  it  is  not 
restored  when  the  load  is  removed.  However,  when  the  alloy  is  heated,  martensite  is  converted  to 
the  parent  austenite  phase  and  the  material  is  restored  in  its  original  shape.  In  the  case  of  the  In-Tl 
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system,  the  martensite  phase  is  a  FCT  (fact  centered  tetragonal)  and  the  parent  phase  is  FCC  (face 
centered  cubic). 


Fig.  2.9  Illustration  of  the  shape  memory  effect 


2.3.2  Pulse  Plating 

Indium-thallium  alloys  were  electrodeposited  from  the  electrolyte  containing  40.6  g/1  of 
indium  sulfate,  2.2  g/1  of  thallium  sulfate,  and  28  ml/1  of  sulfuric  acid  with  the  peak  current 
densities  from  6.7  mA/cm2  to  67  mA/cm2.  Platinum  foil  was  used  as  an  anode  and  the  cathode 
was  a  glassy  carbon  plate.  The  effect  of  peak  current  density  on  the  composition  of  the  alloy 
deposit  is  shown  in  Figure  2.10.  The  alloy  deposits  obtained  at  low  current  densities  in  the  range 
of  6.7  to  10  mA/cm2  show  high  thallium  content  of  30-40  at%  and  an  increase  in  current  density 
decreased  the  thallium  content  in  the  deposit. 


Fig.  2.10  Effect  of  peak  current  density  on  thallium  content 
of  the  deposits  obtained  at  25°  C  without  agitation 
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As  the  current  density  increases,  the  effect  on  composition  seems  to  diminish.  In  the 
current  density  region  used  in  this  study,  thallium  is  deposited  at  its  limiting  current  value. 
Therefore,  any  increase  in  current  density  actually  leads  to  an  increase  in  the  rate  of  indium 
deposition.  At  the  high  current  density  region  from  50  to  70  mA/cm2,  the  thallium  content 
approaches  the  limiting  value,  5.3  at%,  which  was  the  concentration  of  thallium  in  the  electrolyte. 
The  composition  of  the  alloy  does  not  seem  to  be  dependent  on  the  other  pulse  parameters  such  as 
duty  cycle  and  off  time. 

2.3.3  Screening  design  Test 

A  Plackett-Burman  statistical  design  test  {29]  was  performed  to  investigate  the  effect  of 
selected  controllable  process  variables  on  the  composition  of  the  deposit.  This  design  is  based  on  a 
simplified  first  order  empirical  model,  but  will  also  yield  information  indicating  overall  curvature, 
or  deviation  from  the  linear  model  and  gives  an  indication  of  the  presence  of  interaction  effects 
among  variables.  An  eight-run  screening  design  was  chosen,  and  the  experimental  procedure 
employed  follows  basically  that  described  in  the  literature  [32]. 

Pulse  plating  parameters  used  in  this  test  were  on  time,  off  time,  peak  current  density,  and 
temperature.  The  factor  levels  and  the  calculation  of  factor  effects  are  shown  in  Table  2.5.  The 
screening  test,  as  given  in  Table  2.6,  shows  that  the  peak  current  density  is  the  only  significant 
factor  in  changing  the  composition  of  the  alloys,  with  the  results  indicating  that  an  increase  in  peak 
current  density  causes  a  decrease  in  thallium  content. 


Table  2.5  Statistical  design  results  for  pulse  plating  of 
Indium-Thallium  alloy. 


Table  2.6  Summary  of  factor  effect  intervals 
at  95%  confidence  limit 


Oder 

Response 

<n  *%> 

Xt 

X2 

Factor  vanatte 

X3  X4  XS 

X9 

X7 

2 4  27 

20 

20 

20 

25 

* 

. 

2 

2t  93 

40 

40 

40 

45 

• 

3 

35  25 

20 

40 

20 

45 

• 

4 

♦  9  29 

40 

40 

20 

25 

• 

S 

19  41 

20 

20 

40 

45 

• 

t 

15  25 

40 

20 

40 

25 

• 

7 

14  44 

20 

40 

40 

25 

• 

• 

19  20 

40 

40 

40 

45 

• 

• 

• 

192  74 

72  37  95  IS  92  30  M  49  79  54  90  97 

74  41 

I 

90  37 

77  4* 

100  44  73  2SM  20 

7|  77  M  33 

Oo»s«e 

1900 

792 

39  >419  24 

999 

19  20 

13  92 

Fedor  Effect 

20  34 

-4  SO 

l  9t 

9  54 

409 

.?  42 

4  10 

349 

•  ttcaitt  regn  tM  o *  (actor  vanana  imii 


Mcatf  Krn  '■•turn  Of  MOO*  van onto  levels 
Xi  on  emeims)  X2  ot  »me<me|  X3  peM  currem  den«ty(mA*cm2} 
Xf  Wf*ot*w*t  Cl mq  XS  X*  X7  dummy 
Center  pt>«V  response  If  1  7  U7l  M  lf|< 

Average  ISM  Standard  Don— o*<«l  ON 

Faeaw  e*ea  of  x*  .  jjjSSBSSSX  *  Xl)  tfcgflg**  *  Xi| 
rwmoer  ot  laaonoi  <vv? 


Facie*  Effect  'niervai 


X.fon  fimef 

-*  SO  i  5  23* 

X*(Oft  lime) 

i  9>t  523 

X)(pea*  e  dl 

9  54  t  5  23 

X«(Temp  ) 

4  09  1  5  73 

Xi(dummy) 

•2  42  t  5  23 

X«(dummy  1 

4  90  t  523 

X  it  dummy! 

3  49  t  5  23 

•  Thp  nu*rrt>*f  *5  23'  denotes  (actor  effects  which  *»•  siat<stica<>v 
significant  at  95%  eonfdenca  m«ei 

Conhoenco  interval  .  factor  effect  ;  is/Vn/4 
at  95%  confidence  limit  1  -  4  303  where  t  -s  me  student  $  > 
variable  N  *s  the  number  of  design  pemt  *uns/S)  s  •*  the  standard 
rteviaiionto  96) 


18 


2.3.4  Phase  Transformation  and  Shape  Recovery 


In-Tl  alloys  in  a  certain  composition  range  have  a  FCC  structure  as  a  parent  phase  and 
undergo  a  martensitic  transformation  to  a  FCT  structure  on  cooling  [31].  Figure  2.1 1  shows  the 
X-ray  diffraction  profiles  of  21  at%  T1  alloy  deposit.  The  figure  indicates  that  the  electrodeposited 
alloy  undergoes  the  phase  transformation  with  change  in  temperature.  According  to  the 
experimental  results  on  thermally  prepared  equilibrium  alloys  [27],  the  transformation  temperature 
(As)  of  a  21  at%  T1  is  around  60°  C  and  the  alloy  transforms  completely  by  63-65°  C.  Since  the 
electrodeposited  alloy  shows  a  mixture  of  FCC  and  FCT  phases  in  140°  C,  it  seems  the 
transformation  was  not  complete,  even  when  the  temperature  was  well  in  excess  of  that  expected 
for  thermally  prepared  alloys.  Figure  2.12  shows  the  grain  structures  of  both  thermally  prepared 
alloys  and  electrodeposited  alloys  of  a  similar  composition.  The  electrodeposited  alloys  has  small 
grains,  a  few  pms  in  diameter  and  somewhat  varied  in  grain  size  while  the  thermally  prepared 
sample  has  much  larger  grains  approximately  hundreds  of  pms  in  size.  Some  qualitative  tests  were 
made  to  see  if  the  electrodeposited  alloys  exhibit  the  shape  memory  effect.  An  electrodeposited 
alloy  was  cut  into  a  narrow  strip  and  bent  at  room  temperature.  When  the  sheet  was  heated, 
movement  toward  its  original  flat  shape  was  observed  showing  at  least  a  partial  recovery  and  the 
shape  memory  effect.  The  deformation-recovery  test  was  repeated  several  times  and  recovery  was 
reproduced  in  every  instance. 


Fig.  2.11  X-ray  diffraction  profiles  of  In-2 1  at%  T1 
alloy  deposit 


Fig.  2.12  The  grain  structures  of  Indium- 
Thallium  alloys 

(a)  Thermal.  22  at*7r  T1 

(b)  Electrodeposited.  2 1  at%Tl 
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2.3.5  Summary 


Indium-thallium  alloys  can  be  successfully  deposited  from  a  sulfate  electrolyte  using  pulsed 
current.  An  increase  in  peak  current  density  causes  a  decrease  in  thallium  content  in  the  deposit. 
The  results  from  the  screening  test  show  that  the  peak  current  density  is  the  only  significant  factor 
which  affects  the  composition  of  electrodeposited  alloys.  The  electrodeposited  alloy  shows  a 
martensite  to  parent  phase  transformation,  FCT  to  FC  upon  heating.  The  temperature  interval  for 
the  transformation  of  the  electrodeposited  alloys  is  much  wider  than  those  of  the  thermally  prepared 
alloys  but  the  electrodeposited  alloy  does  not  show  the  shape  recovery.  The  ability  to  produce 
these  new  types  of  shape  memory  materials  by  electrolysis  appears  to  offer  some  exciting 
possibilities  for  incorporation  into  adaptive  structures.  However,  considerably  more  research  is 
required  to  more  clearly  define  and  characterize  these  materials  in  order  to  optimize  their  potential 
applications  in  actual  control  systems.  Future  plans  call  for  optimizing  the  properties  of  the  In-Tl 
electrodeposited  alloys  and  investigate  their  use  as  possible  sensors  in  the  active  control  of  a  simple 
cantilever  beam. 
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2.4  Characterization  of  Phase  Transformation  Behavior  in  Electrolyticaily 
Produced  Indium-Thallium  Shape  Memory  Alloy  Films 

Alloys  in  the  composition  range  of  15-30  at%  thallium  are  known  to  have  a  face  centered 
cubic(FCC)  structure  as  a  parent  phase  which  undergo  a  martensitic  transformation  to  a  face  centered 
tetragonal(FCT)  structure  on  cooling,  accompanied  by  a  very  small  heat  of  transformation  and 
volume  difference  [33].  Bowles,  Barrett  and  Guttman  [34,35]  proposed  the  transformation  occurs 
by  a  double  shear  process  which  is  first  on  the  (101)  in  the  [101]  direction  followed  by  a  second 
shear  on  (Oil )[0 11].  Since  they  reported  the  FCC/FCT  transformation  in  In-Tl  alloys,  the 
transformation  behavior  of  the  alloy  has  been  extensively  studied  [36-40].  The  superelastic 
phenomenon  associated  with  the  stress-induced  transformation  and  rubberlike  behavior  due  to 
reorientation  of  martensite  plates  of  In-Tl  alloys  have  been  reported  by  Burkart  [41]  and  Miura  [42]. 
Nagasawa  [43]  and  Nittono  [44]  have  shown  experimentally  that  the  alloy  exhibits  a  shape  memory 
effect(SME)  by  which  an  alloy,  apparently  plastically  deformed,  reverts  to  its  original  shape  upon 
heating  to  high  temperature. 

Since  Olander  [45]  reported  the  shape  memory-related  phenomenon  in  Au-Cd  alloys,  many 
alloy  systems  [46-50]  such  as  Ni-Ti  and  Cu-Al-Ni,  showing  the  shape  memory  effect(SME),  were 
developed  and  characterized  [51-54].  Recently,  attention  has  been  toward  developing  commercial 
applications  for  shape  memory  alloys(SMA)  such  as  tube  fitting,  electrical  connectors,  sensors  and 
actuators  in  macro  and  micro  scale  systems  and  for  smart  structures.  [55-60].  The  result  is  that 
many  new  fabrication  methods  such  as  sputtering  and  rapid  solidification  are  being  attempted  in 
addition  to  the  conventional  melting  processes  in  an  attempts  to  enhance  the  properties  and 
Performance  of  the  SMA's  [61-64]. 

The  electrodeposition  technique  offers  a  number  of  attractive  features,  in  that  alloys  can  be 
directly  deposited,  in  situ,  at  ambient  temperature  and  in  the  form  of  thin  films.  The  phase  structures 
and  composition  of  alloys  produced  by  electrodeposition  are  often  substantially  different  from  the 
alloys  produced  by  a  conventional  melting  processes.  The  non-equilibrium  condition  is  often  quite 
stable  and  provides  some  interesting  property  differences  from  those  normally  encountered  with 
alloys  of  similar  composition  in  an  equilibrium  state.  In  other  words,  the  characteristics  and 
behavior  of  electrodeposited  shape  memory  alloys  might  be  expected  to  deviate  substantially  from 
those  of  thermally  prepared  equilibrium  alloys.  Previous  research  showed  that  homogeneous,  dense 
In-n  alloy  films  could  be  electrodeposited  using  pulsed  current  from  a  sulfate  electrolyte  and  the 
alloys  showed  the  shape  memory  effects  [65],  The  study  also  focused  on  the  relationship  between 
phase  structures  and  composition  of  electrodeposited  alloys  and  the  transformation  behavior  with 
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temperature.  The  phases  of  the  electrodeposited  alloys  above  room  temperature  were  determined  by 
high  temperature  X-ray  diffraction  techniques,  and  compared  with  those  of  thermally  prepared  alloys 
of  similar  compositions. 

2.4.1  Experimental  Studies 

Production  ofln-Tl  Alloys 

Thermal  methods  were  used  to  make  conventional  alloys  to  compare  with  those  made 
electrolytically.  Preweighed  granules  of  indium(99.999%)  and  thallium(99.99%)  were  melted 
together  under  an  argon  atmosphere  using  a  high  frequency  induction  furnace.  To  ensure 
uniformity,  the  alloy  was  stirred  vigorously  prior  to  rapid  quenching  in  a  graphite  mold.  Resultant 
ingots,  for  which  no  appreciable  change  in  weight  was  detected  after  alloying,  were  cold  rolled  and 
homogenized  at  120°  C  for  48  hours  in  a  tube  furnace  under  an  argon  atmosphere.  Two  alloys.  In  - 
22  and  27  at%  T1  were  prepared  in  this  manner  for  further  study. 

The  electrolyte  was  prepared  by  dissolving  40.59  g/1  of  indium  sulfate  and  2.22  g/1  of 
thallium  sulfate  in  a  sulfuric  acid  solution  containing  28  ml/I  of  sulfuric  acid.  Platinum  foil  was  used 
as  an  anode  and  the  cathode  substrate  was  a  glassy  carbon  plate.  For  the  polarization  measurements, 
a  saturated  calomel  electrode  was  used  as  a  reference. 

In-Tl  alloy  films  were  electrodeposited  using  pulsed  current  generated  by  a  Dynatronix 
Model  DPR  20-1-3  rectifier.  To  obtain  different  compositions,  various  pulse  parameters  were  used 
such  as  peak  current  density,  pulse  frequency-  and  duty  cycle  were  changed.  The  bath  temperature 
was  varied  from  ambient  to  45°  C  and  no  agitation  was  applied.  The  average  thickness  of  the 
deposits  was  about  50  }im.  Alloy  deposits  with  thallium  contents  of  15,  20,  21,  24,  28,  and  37.6 
at%  were  used  for  this  research.  Some  of  the  electrodeposits  were  also  annealed  at  140°  C  for  72 
hours  in  a  tube  furnace  under  an  argon  atmosphere  to  determine  if  any  phase  changes  occurred  on 
heating. 

Characterization  of  In-Tl  Alloys 

The  surface  morphology  of  the  electrodeposits  was  studied  using  scanning  electron 
microscopy.  Chemical  compositions  of  all  the  samples  used  in  this  study  were  obtained  from 
energy  dispersive  X-ray  spectrometer  measurements.  Auger  electron  spectroscopy  was  employed  to 
check  the  uniformity  and  composition  of  the  electrodeposits. 


22 


X-ray  diffraction  profiles  of  both  thermally  prepared  and  eiectrodeposited  alloys  were 
generated  at  room  temperature  and  140°  C.  The  X-ray  diffractometer  used  in  this  study  was  a 
Scintag  XDS-2000  equipped  with  a  high  temperature  furnace.  The  target  metal  was  copper  and  the 
data  were  obtained  in  the  20  range  of  30°  to  70°.  To  find  the  phase  transition  temperature,  the 
furnace  temperature  was  raised  in  5-10°  C  increments  and  held  for  about  5  minutes  at  each 
temperature  prior  to  generating  X-ray  data.  To  prepare  the  alloys  for  metallography,  the  deposits 
were  electropolished  at  room  temperature  using  a  solution  containing  75  ml  carbitol,  5ml 
concentrated  nitric  acid  and  2ml  concentrated  hydrochloric  acid  and  a  voltage  in  the  range  of  30  to 
50V  [34].  The  surface  then  was  studied  using  a  Jeol  JCM-35CF  scanning  electron  microscope. 

2.4.2  Results  and  Discussion 

Polarization  behavior  In  a  simple  acidic  electrolyte,  thallium  and  indium  are  reported  to  have 
about  the  same  standard  electrode  potentials  [66]. 

In+3  +  3e‘  -*  In  E°  =  -  0.340  V 

TI+  +  e‘  -*  TI  E°  =  -  0.336  V 

Due  to  their  similar  electrode  potentials,  codeposition  of  In-Tl  from  electrolytes  containing  simple 
ions  should  be  possible.  Fig.  2.13  and  Fig.  2.14  show  the  forward  scans  of  cyclic  voltammograms 
for  sulfate  electrolyte  systems  obtained  at  room  temperature.  The  curves  indicate  both  metals  deposit 
with  a  significant  cathodic  overpotential  on  a  carbon  electrode,  and  the  degree  of  polarization  is 
higher  with  indium.  As  a  result,  thallium  metal(Fig.  2.13  (c))  deposits  preferentially  and  reaches  a 
limiting  plateau  of  0.  8  mA/cm-  before  indium  deposition  occurs  for  the  concentrations  used.  When 
the  thallium  current  is  in  the  limiting  region,  there  was  a  small  amount  of  hydrogen  gas  evolved  on 
the  cathode.  After  the  potential  exceeds  -  1.30  V,  hydrogen  starts  to  evolve  extensively  and  the 
thallium  current  starts  to  increase.  In  the  case  of  indium  alone,  as  shown  in  Fig.  2.13  (b),  indium 
metal  begins  to  be  deposited  when  the  curve  for  thallium  is  already  in  the  limiting  region.  As  soon 
as  the  indium  deposits,  hydrogen  also  evolves  on  the  cathode.  The  polarization  curve  for  indium 
alone  actually  represents  the  curve  for  indium  and  hydrogen. 
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Fig.  2.13  Fig.  2.14 

The  polarization  curve  for  an  alloy  is  not  necessarily  the  summation  of  the  individual  curves 
for  each  metal  [67],  In  this  case,  however,  it  seems  that  the  alloy  curve  does  approximately 
represent  the  sum  of  the  polarization  curves  for  the  two  metals.  The  polarization  curves  using  an 
agitated  electrolyte  are  shown  in  Fig.  2.14.  It  is  clear  that  the  deposition  of  thallium  is  under 
diffusion  control  at  potentials  above  -  0.8  V  vs  SCE  since  the  limiting  current  density  plateau  shows 
an  increases  when  the  electrolyte  is  agitated.  The  relationship  between  the  potential  and  current 
density  of  the  indium-thallium  system  shows  that  the  process  is  largely  controlled  by  diffusion  of 
thallium  making  it  more  sensitive  to  electrolyte  agitation  than  indium.  When  the  system  was 
mechanically  agitated  in  some  preliminary  experiments,  the  thallium  content  in  the  deposit  was 
dramatically  increased.  In  fact,  the  deposit  became  thallium  rich  and  was  very  dendritic,  and 
subsequent  alloy  deposits  were  made  without  agitation. 

When  standard  DC  was  used,  it  was  also  difficult  to  get  a  uniform  and  smooth  alloy  deposit. 
The  deposits  were  dendritic  probably  because  the  thallium  was  being  deposited  at  its  limiting 
current.  In  this  study,  therefore,  pulsed  current  was  chosen  to  electrodeposit  the  In-Tl  alloys.  In 
order  to  find  the  optimum  plating  condition  to  produce  a  fine  deposit,  preliminary  experiments  and 
screening  design  tests  were  performed  as  described  previously  [65].  As  a  result,  uniform,  dense 
alloy  deposits  with  the  desired  thallium  content  could  be  produced  using  a  peak  current  density  of  10 
to  40  mA/cm2  with  10  msec  as  the  on  time  and  duty  cycle  of  about  30%  at  room  temperature. 
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Structure  of  as-deposited  alloys 


In-Tl  alloys  in  a  certain  composition  range  have  a  face  centered  cubic(FCC)  structure  as  a 
parent  phase  and  undergo  a  martensitic  transformation  to  a  face  centered  tetragonal  structure(FCT) 
on  cooling  [36-37],  In  this  study,  phase  structures  of  electrodeposited  alloys  having  a  composition 
of  15,  20,  21,  24,  28,  and  38  at%Tl  were  investigated  using  high  temperature  X-ray  diffraction. 
The  results  are  shown  in  Table  1.  According  to  the  equilibrium  phase  diagram  [68]  and  the 
experimental  results  on  thermal  alloys  [39-40],  a  28%T1  alloy  has  only  the  FCC  structure  from 
ambient  temperature  to  the  melting  point.  However,  the  electrodeposited  28%T1  alloy  shows  a 
mixture  of  FCC  and  FCT  phases  at  room  temperature.  It  seems  the  martensite-austenite 
transformation  was  not  complete  for  the  21,  24,  and  28%T1  alloys  even  when  the  temperature  was 
taken  as  high  as  1400C,  since  some  FCT  phase  was  detected  by  X-rays.  The  20%T1  does  not 
undergo  a  phase  transformation  at  all,  remaining  FCT  throughout. 

Alloys  having  thallium  contents  of  22  and  27  at%  were  thermally  prepared  to  compare  the 
results  of  X-ray  data  with  electrodeposited  alloys  of  similar  compositions.  The  results  are  also 
shown  in  Table  2.7.  The  thermal  alloy  of  22  at%Tl  shows  FCT  at  room  temperature  and  completely 
transforms  to  the  FCC  phase  by  55°  C.  The  27%T1  alloy  was  also  FCC  at  all  temperatures  studied. 
This  behavior  is  in  good  agreement  with  the  equilibrium  phase  diagram  and  other  reports  in  the 
literature. 


Table  2.7.  Relationship  Between  Composition  and  Phase  Structure  of  Deposited  Alloys 


TI  Content  Phase  Structure 


(at%) 

25°  CfEquil.) 

140°  CfEquil.) 

15.0 

all  FCKall  FCT) 

all  FCT(alI  FCT) 

20.0 

all  FCT(aII  FCT) 

all  FCTfall  FCC) 

21.0 

all  FCTfall  FCT) 

63%  FCC(all  FCC) 

22.0(T) 

(all  FCT) 

(all  FCC  at  55°  C) 

24.0 

all  FCT(alI  FCC) 

60%  FCC(all  FCC) 

27.0(T) 

(all  FCC) 

(all  FCC) 

28.0 

24%  FCC(all  FCC) 

94%  FCC(all  FCC) 

38.0 

all  FCC(all  FCC) 

all  FCCfall  FCC) 

(T):  stands  for  thermally  prepared. 

Equilibrium  structures  are  indicated  in  parentheses. 
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The  relationship  between  phase  and  composition  of  electrodeposited  In-Tl  alloys  shows  a 
large  deviation  from  equilibrium  alloys  of  similar  composition.  The  typical  example  of  X-ray 
diffraction  profiles  of  electrodeposited  alloy  obtained  at  ambient  temperature  and  140°  C  are  shown 
in  Fig.  2. 15.  The  sample  was  cooled  to  ambient  temperature  after  the  x-ray  work  at  140°  C  and  the 
procedure  was  repeated.  The  FCT  to  FCC+FCT  phase  transformation  with  temperature  increase 
was  nearly  identical  to  the  original  test,  indicating  that  the  change  was  reversible.  Thus,  even 
though  the  film  was  transformed  into  the  high  temperature  FCC  structure,  it  reverted  to  its  original 
non-equilibrium  condition  upon  cooling. 


Two  Theta(Degree) 


Fig.  2.15 


Fig.  2.16 


Usually,  a  temperature  interval  for  a  complete  phase  transformation  of  an  In-Tl  alloy  is  less 
than  10°  C  [40],  Fig.  2.16(a)  shows  the  phase  transformation  of  a  thermally  prepared  22%T1  alloy 
in  which  the  transformation  starts  at  around  50°  C  and  is  nearly  complete  at  55°  C.  Fig.  2.16(b) 
shows  the  change  in  structure  of  a  electrodeposited  24%T1  alloy  deposit  upon  heating.  The  amount 
of  austenite  in  the  deposit  was  calculated  from  the  intensities  of  X-ray  diffraction  data  such  as  Fig. 
2. 15.  The  alloy  is  FCT  at  room  temperature  and  some  FCC  begins  to  appear  at  about  60°  C.  Even 
at  140°  C,  there  are  some  martensite  phase  remaining  in  the  alloy.  It  seems  that  the  transformation  is 
not  complete  even  over  this  wide  temperature  interval.  The  same  transformation  behavior  was 
shown  in  a  2 1  %TI  alloy. 


The  results  for  the  thermal  alloy  eliminate  the  possibility  that  the  sluggishness  in 
transformation  exhibited  by  the  electrodeposited  alloys  might  be  due  to  the  phase  stability  [41] 
caused  by  the  experimental  procedure  or  the  method  of  obtaining  the  data.  The  X-ray  data  were 
obtained  at  several  temperatures  and  the  sample  had  to  be  held  for  several  minutes  at  each 
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temperature  during  the  determination.  Another  possible  reason  for  the  slowness  in  the 
transformation  kinetics  would  be  non-uniformity  in  the  composition  of  the  deposit,  because  *he 
transformation  temperature  is  quite  sensitive  to  the  composition.  Therefore,  if  the  alloy  has  a  non- 
uniform  composition,  the  transformation  of  low  TI  content  alloys  starts  when  the  transformation  of 
high  7 1  content  alloys  is  already  complete.  Fig.  2. 17  shows  the  compositional  variation  obtained  by 
AES  throughout  the  deposit.  The  composition  of  the  electrodeposited  alloy  is  fairly  uniform  thus 
indicating  that  compositional  variations  are  not  the  reason  for  the  observed  effects.  The 
transformation  temperature  (As;  austenite  start)  of  a  24  %TI  allo\  is  supposed  to  be  around  O  C. 
however,  the  As  temperature  of  the  electrodeposited  alloy  deposit  shown  in  Fig.  2.16  is  about  60° 
C,  which  is  much  higher  than  for  thermal  alloys  of  the  same  composition.  There  have  been  many 
reports  on  the  decrease  in  Ms  (martensite  start)  temperature  due  to  grain  refinement  [69-7 1  ].  Thus, 
this  increase  in  As  temperature  may  result  from  the  fine  grain  structure  of  the  electrodeposited  alloy. 
Fine  grained  materials  with  more  grain  boundary  barriers  per  unit  volume  tend  to  inhibit  the 
transformation  [69],  therefore,  an  increase  in  temperature  is  needed  to  initiate  the  transformation. 
The  micrographs  of  the  alloy  deposit  shown  in  Fig.  2.18  indicate  a  fine  grain  structure  with  sizes 
ranging  from  several  pm  to  several  tens  of  pm.  The  composition  across  the  grains  was  fairly 
uniform,  as  determined  by  EDS  analysis,  so  grain  boundary  segregation  does  not,  appear  to  be  a 


problem. 


Fig.  2.17 


Fig.  2.18 
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Effect  of  Heat  Treatment 


After  X-ray  analysis  in  the  as-deposited  condition,  selected  samples  of  electrodeposited 
alloys  were  annealed  in  a  tube  furnace  at  140°  C  for  72  hours.  The  effect  of  annealing  on  the  phase 
transformation  temperature  is  given  in  Table  2.8. 

Table  2.8.  Relationship  Between  Composition  and  Phase  Structure  of  Annealed  Alloys 


T1  Content  Phase  Structure 


(at%) 

140°  CflEquil.) 

15.0 

all  FCTfall  FCT) 

all  FCT(all  FCT) 

20.0 

all  FCT(all  FCT) 

all  FCTfall  FCC) 

21.0 

all  FCTfall  FCT) 

20%  FCCfall  FCC) 

24.0 

all  FCT(all  FCC) 

53%  FCCfall  FCC) 

28.0 

all  FCC(all  FCC) 

all  FCC(all  FCC) 

38.0 

all  FCCfall  FCC) 

all  FCCfall  FCC) 

Equilibrium  structures  are  indicated  in  parentheses. 


In  essence,  annealing  did  not  dramatically  affect  the  transfoimation  behavior  of  the  electrodeposited 
alloys.  One  change  occurring  after  heat  treatment  is  that  the  percentages  of  the  FCC  phase  for  the 
21%  and  24%TI  alloys  at  high  temperature  are  even  lower  than  the  as-deposited  alloys.  This  result 
is  not  normally  expected  because  if  any  change  occurs  the  heat  treatment  would  be  expected  to  shift 
the  alloys  toward  the  FCC  equilibrium  condition.  The  transformation  kinetics  remain  sluggish  after 
heat  treatment  as  indicated  in  Fig.  2.19.  The  data  show  that  the  heat  treatment  cycle  used  in  this 
study  is  not  enough  to  shift  the  electrodeposited  alloys  from  a  non-equilibrium  to  an  equilibrium 
state.  Possibly  a  slightly  higher  temperature  and  longer  time  or  some  cold  working  prior  to  neat 
treatment  may  be  necessary. 
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Thallium  Content  (at%) 


Fig.  2.19 


Fig.  2.20 


Fig.  2.20  shows  the  difference  in  transformation  temperature  between  thermal  alloys  [34,42] 
and  electrodeposited  alloys  with  the  transformation  temperature  being  higher  for  alloys  produced  by 
electrodeposition.  The  relationship  between  the  thallium  content  and  the  c/a  ratios  of  FCT  phases  is 
given  in  Fig.  2.21.  The  c/a  ratio  of  electrodeposited  alloys  at  25°  C  is  a  bit  higher  than  those  of 
thermally  prepared  alloys  and  the  ratio  decreases  as  the  temperature  increases,  indicating  the  alloy  is 
transforming  from  tetragonal  to  cubic.  This  higher  degree  of  tetragonality  in  the  electrodeposited 
alloys  might  partially  explain  the  higher  transformation  temperature.  Also  there  is  no  major 
difference  in  c/a  ratio  between  as-deposited  alloys  and  annealed  alloys. 


a)  Annealed,  25°C 

b)  As-deposited,  25°C  \  -i 

c)  Pollock,  25°C[8]  \  ' 

d)  Annealed,  1 40°C  ** 

e)  As-deposited,  140°C 

f)  Thermally  prepared,  25°C 


Thallium  Content  (at%) 
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Also,  the  internal  stress  level  of  the  electrodeposited  alloys  m  .y  be  relatively  high  due  to  the 
nature  of  the  electrocrystallization  process,  absorption  of  impurities  or  organic  matter,  solubility  of 
hydrogen  or  similar  defects  generated  during  electrolysis.  Such  stress  levels  could  be  responsible 
for  the  transformation  behavior  of  the  electrodeposited  alloys  and  deviation  from  equilibrium  noted 
in  the  results. 


Shape  Recovery  in  Electrodeposited  in  In-Tl  Alloys 

Some  qualitative  tests  were  made  to  see  if  the  electrodeposited  alloys  exhibits  the  shape 
memory  effect.  An  electrodeposited  alloy  was  cut  into  a  narrow  strip  and  bent  into  a  ring  shape  at 
room  temperature.  When  the  sheet  was  heated,  movement  toward  its  original  shape  was  observed 
showing  at  least  a  partial  recovery  and  the  shape  memory  effect.  The  shape  recovery  was  confirmed 
for  the  21,  24,  and  28%T1  alloys.  The  deformation-recovery  test  was  repeated  several  times  and 
recovery  was  reproduced  in  every  instance.  Additional,  more  quantitative  evaluations  will  be 
necessary  to  more  completely  characterize  the  electrodeposited  alloys. 

2.4.3  Summary 

The  shape  memory  alloys  deposited  electrolytically  exhibit  a  substantially  different 
composition-phase  relationship  than  indicated  by  alloys  in  an  equilibrium  condition.  The 
electrodeposited  alloys  having  thallium  contents  between  21-28  at%  show  a  martensite  to  parent 
phase  transformation,  FCT  to  FCC  upon  heating.  The  transformation  temperatures  of  the 
electrodeposited  alloys  are  much  higher  than  those  of  the  thermally  prepared  alloys.  The  temperature 
interval  for  the  transformation  of  the  electrodeposited  alloys  is  much  greater  than  those  of  the 
thermally  prepared  alloys.  The  electrodeposited  alloys  having  FCT  phases  at  25°  C  show  higher  c/a 
ratios  compared  with  the  thermally  prepared  alloys.  The  deformed  21-28  at%Tl  electrodeposited 
alloys  at  25°  C  show  a  recovery  to  their  original  shape  upon  heating,  a  response  indicating  that  it  is 
possible  to  produce  shape  memory  alloys  by  electrolysis. 
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3.  Description  of  Test  Articles  and  Associated  Electronic  Circuits 


To  demonstrate  some  of  the  capabilities  of  smart  structures  and  to  determine  the  limiations  imposed  by 
hardware  realization,  we  have  designed  and  fabricated  experimental  test  articles  incorporating  flexible  struc¬ 
tures  with  shape  memory  actuators,  strain  gauge  sensors,  signal  processing  circuits  and  digital  controllers 
Two  flexible  mechanical  systems  were  designed  an  fabricated  in  our  laboratory.  The  first  was  an  aluminum 
cantilever  beam  for  which  the  free-vibration  damping  of  the  fundamental  mode  was  enhanced.  This  sy  stem  is 
a  single-input,  single-output  system  with  one  actuator  and  sensor.  The  second  system  studied  was  a  three- 
mass  system  with  its  three  natural  frequencies  below  3.2  hz.  This  system  constitutes  a  multi-input,  multi-out¬ 
put  system  with  two  actuators  and  two  sensors.  The  electronic  circuits  for  signal  processing,  SMA  drivers 
and  data  acquisition  systems  have  been  designed  fabricated  an  mounted  on  the  test  articles.  A  brief  descrip¬ 
tion  of  the  test  articles  and  associated  electronic  circuits  is  presented  in  this  section. 

The  single-input  single-output  test  article  used  was  a  cantilever  beam  made  of  aluminium  with  shape 
memory  actuators  and  strain  gauge  sensors.  The  sensing  system  consisted  of  four  strain  gauges  placed  at  the 
fixed  end  of  the  cantilever  beam  and  connected  in  a  full  bridge  configuration.  Actuation  of  the  structure  is 
accomplished  with  a  set  of  NiTiNOL  wires  with  driver  circuits  to  provide  heating.  A  tip  mass  was  added  to 
the  beam  to  lower  the  frequenc  y  of  the  first  natural  mode  to  within  the  bandwidth  of  the  shape  memory  actu¬ 
ators.  The  top  view  of  the  test  article  is  below: 


Strain  gauge  amplifier  circuit 


Swing  Path  of  the  beam 
NiTiNOL  Wires  (actuator) 

Beam 

r 

Tip  Mass 


Strain  gauges  affixed  to  the 
beam  (sensor) 


NiTiNOL  wire  driver  circuits 


Plexiglass  posts 
holding  the  NiTiNOL 
wires 


Figure  3.1.  Top  view  of  the  cantilever  beam  test  article 

Another  flexible  structure  was  designed  for  use  as  a  multi-input/multi-output  system.  It  consists  of  three 
steel  masses  suspended  vertically  by  thin  aluminum  struts.  The  masses  were  chosen  such  that  the  structure 
would  have  multiple  natural  frequencies  within  the  limited  bandwidth  of  the  NiTiNOL  wire  actuators.  This 
test  article  is  depicted  in  Fig.  3.2. 
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Two  sets  of  full  bridge  strain  gauges  are  bonded  to  the  struts  at  the  locations  indicated  in  Fig.  3.2.  The 
signal  conditioning  electronics  for  each  bridge  are  mounted  on  the  closest  mass  and  as  close  to  the  bridge  as 
possible  to  minimize  measurement  noise.  Actuation  of  the  structure  is  accomplished  with  two  sets  of  NiTi- 
NOL  wires  with  drivers  to  provide  heating.  One  set  of  wires,  arranged  vertically  along  each  strut,  applies  a 
moment  at  the  bottom  of  the  structure.  A  second  set  of  wires,  connected  diagonally  from  the  top  of  one  strut 
to  a  point  on  the  opposite  strut  just  above  the  top  mass,  exerts  a  force  on  the  structure. 

The  test  structure  can  be  excited  with  a  wide  range  of  initial  conditions.  Large  amplitude  initial  condi¬ 
tions  can  permanently  deform  the  NiTiNOL  wire.  This  is  seen  as  a  loss  of  pretension  of  the  actuator  after  the 
controller  has  suppressed  vibrations  in  the  beam  and  it  is  once  again  at  rest.  This  deformation  of  the  SMA 
actuators  results  in  system  parameter  variations.  The  experimental  data  presented  in  this  paper  is  the 
response  with  the  beam  excited  in  the  first  mode  and  the  bottom  mass  displaced  by  one  inch.  This  moderate 
amplitude  initial  condition  was  chosen  to  limit  the  permanent  deformation  of  the  force  SMA  actuators. 


*•>*  — . 
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Figure  3.2.  3-Mass  System  Test  Article 


3.1.  NitiNOL  wire  Actuation 

Each  shape  memory  alloy  actuator  was  a  pair  of  uncoated,  trained,  9  mil  NiTiNOL  wires  with  a  transi¬ 
tion  temperature  of  35°C.  The  control  force  was  generated  by  heating  the  wire  with  a  current  which  causes 
the  wire  to  contract  due  to  the  shape  memory  effect.  To  maximize  the  usable  bandwidth  of  the  actuator, 
forced  air  cooling  was  used  on  the  cantilever  beam  experiment  to  reduce  the  cooling  time  constant  of  the 
wire.  The  dynamics  of  the  actuator  were  experimentally  determined  to  be  described  by  a  first  order  low  pass 
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system  with  an  effective  time  constant  of  2.6  sec.  The  static  input-output  characteristic  of  the  wire  is  shown 
in  Fig.  3.3.  The  NiTtNOL  wire  drivers  were  voltage-controlled  current  amplifiers  as  shown  in  Fig.  3.4 
designed  to  drive  the  wires  in  their  linear  range  as  shown  in  Fig.  3.4.  One  driver  was  used  for  each  NiTtNOL 
wire. 


Current  (mA) 

Figure  3.3.  NiTtNOL  wire  dynamics 


Inverter  Rectifier  +  dc  offset  cancellation  Amplifier 


Stage 


Figure  3.4.  NiTtNOL  wire  driver  circuit  diagram 
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The  first  part  of  the  driver  rectifies  the  incoming  sinusoidal  voltage.  The  0.7  volt  d.c  offset  generated  by 
the  rectifying  diode  is  cancelled  by  the  next  stage.  This  half  wave  rectified  signal  with  zero  d.c.  offset  is 
amplified  to  a  certain  voltage  before  it  is  used  to  drive  the  darlingtion  pair  which  in  turn  passes  a  current 
through  the  NiTiNOL  wire. 

The  NiTiNOL  wire  used  is  uncoated,  pre-trained,  nine  mil  wire  with  a  transition  temperature  of  35°C. 
Each  wire  is  clamped  to  the  structure  with  an  initial  tension  of  0.S  lbs.  The  tension  in  each  wire  is  measured 
with  a  load  cell  mounted  in  line  with  the  wire.  The  wires  are  driven  such  that  the  current  supplied  for  one 
wire  is  180°  out  of  phase  with  the  current  of  the  other  driver  in  the  pair.  Each  driver  circuit  has  adjustments 
for  gain,  bias  and  dead-band.  A  small  dead  band  is  allowed  in  the  driver  circuit  so  that  current  is  never 
applied  simultaneously  to  both  wires. 

A  second  component  of  the  dead  band  is  the  SMA  actuator  itself.  Since  the  actuation  force  is  a  function 
of  the  temperature  of  the  SMA  wire,  small  amounts  of  current  flow  thought  the  wire  with  no  resulting  force. 

The  relation  between  the  EXT  voltage  applied  to  the  driver  and  the  resulting  force  in  one  of  the  moment 
actuators  is  shown  in  Fig.  3.5.  From  Fig.  3.5,  it  can  be  seen  that  the  deadband  of  the  SMA  actuator  for  the 
DC  input  is  approximately  1 .6  volts.  The  actual  deadband  during  operation  of  the  system  will  vary  from  the 
DC  value  due  to  the  heating  and  cooling  of  the  NiTiNOL  actuator.  The  dead-band  has  the  benefit  of  increas¬ 
ing  the  noise  immunity  of  the  actuator. 


DC  Volts 

Figure  3.5.  NiTiNOL  Force  Characteristics 
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3.2.  Strain  Gauge  Sensing 

The  strain  gauge  bridge  was  powered  by  a  regulated  10V  supply  and  the  output  was  amplified  by  an 
instrumentation  amplifier  with  a  gain  of  approximately  110.  Assuming  the  strain  in  the  beam  at  the  fixed  end 
is  proportional  to  the  tip  displacement,  the  sensor  voltage  is  proportional  to  the  tip  displacement. 

The  bridge  configuration  of  the  four  strain  gauges  and  the  difference  amplifier  circuit  is  as  shown  in  the 
following  circuit. 


Differential  amplifier' 
Figure  3.6.  Sensor  circuit  diagram 


The  differential  voltage  out  of  the  strain  gauge  bridge  appears  at  points  VI  and  V2.  This  voltage  is 
amplified  by  the  differential  instrumentation  amplifier  to  generate  a  single  ended  voltage  that  is  proportional 
to  the  (VI -V2). 

33.  Controller  Implementation 

For  purposes  of  implementing  the  digital  controllers,  the  strain  gauge  electronic  circuits  and  the  driver 
circuits  are  interfaced  to  a  personal  computer.  For  the  cantilever  beam  test  article,  a  Keithley-Metrabyte 
DAS-20  interface  board  was  used  for  the  A/D  and  D/A  conversions.  For  the  3-mass  system,  a  Keithley- 
Metrabyte  DAS-20  A/D  board  with  a  SSH-4  simultaneous  sample  and  hold  interface  was  used  for  the  A/D 
conversions.  This  allows  the  strain  gauge  data  collection  for  control  as  well  as  the  collection  of  other  data  of 
interest  such  as  wire  load  cell  information.  The  driver  circuits  on  the  3-mass  system  are  interfaced  to  the  per¬ 
sonal  computer  through  a  Keithley-Metrabyte  DDA-06  simultaneous  sample  and  hold  board. 
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The  control  block  diagram  representation  of  the  test  articles  is  as  shown  below. 


Figure  3.7.  Block  diagram  representation  of  the  test  article 


The  physical  implementation  of  the  system  is  represented  in  the  following  diagram. 


386-33  PC 


Figure  3.8.  Block  Diagram  of  Experimental  Setup 
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4.  Structural  Modelling  Techniques 


The  objective  of  this  study  was  to  show  the  effectiveness  of  robust  control  on  structural  behavior  of 
smart  structures.  Most  of  these  techniques  rely  on  an  accurate  knowledge  of  the  systems  modal  characteris¬ 
tics.  A  mathematical  representation  of  a  system  can  be  obtained  in  two  different  ways.  The  first  method  is  to 
generate  a  model  based  on  the  various  laws  of  physics  and  the  knowledge  about  the  physical  construction  of 
the  system.  These  analytical  models  for  composite  structures  can  be  developed  by  using  finite  element  meth¬ 
ods.  The  second  method,  known  as  the  Structural  Identification  Technique,  utilizes  the  experimental  input- 
output  response  of  the  system  for  deriving  mathematical  models  of  the  structural  systems.  The  results  of 
structural  identification  are  utilized  in  validating  the  finite  element  models.  Juang  and  Pappa  [92]  have 
developed  a  modal  parameter  identification  method,  called  the  Eigensystem  Realization  Algorithm  (ERA) 
for  determining  a  mathematical  model  of  a  structure  directly  from  experimental  data.  A  salient  feature  of  this 
method  is  that  it  allows  one  to  include  only  good  or  strongly  measured  signals  without  loosing  any  capabili¬ 
ties.  This  feature  of  selection  of  appropriate  data  minimizes  the  effects  of  measurement  noise  on  the  identi¬ 
fied  parameters  of  the  system.  The  ERA  method  is  used  for  determining  a  mathematical  model  of  the 
cantilever  beam  with  actuators  and  sensors. 

4.1.  The  Eigensystem  Realization  Algorithm 

The  Eigensystem  Realization  Algorithm  [92]  uses  the  Markov  parameters  of  a  system  to  identify  a  mini¬ 
mal  order  state  space  representation  of  the  system.  The  Markov  parameters  of  a  physical  system  Y(k)qm  (q  = 
#  of  outputs  of  the  system,  m  =  #  of  inputs)  are  used  to  form  a  rxs  block  size  Hankel  matrix. 


«„(*-!) 


yw  r(*+l) 

Hk+  1) 


|K(*  +  r-l)  ... 
The  singular  value  decomposition  of  Hr,  (0)  gives 


H„(0)  =  PDQt 


Y(k  +  s-  1) 

Y(k  +  r+s-  2)_ 


0) 


(2) 


where  P  and  Q  are  nonsingular  matrices  and  D  is  a  diagonal  matrix  of  singular  values.  If  there  are  n  non¬ 
zero  singular  values  in  D  then  the  matrices  P,  D  and  Q  can  be  truncated  to  a  rqxn  sized  P,,  a  n  x  n  sized  D, 
and  a  nx ms  sized  £?, .  Then 

Hr,(0)  =  P.xD.xQf  (3) 


Here  n  becomes  the  order  of  the  identified  system. 

Define  two  matrices  £,  =  [/,.0,. . ...0,]  and  £„  =  [/m, 0„,  ...,0Jr  where  Iq  =  qxq  identity  matrix  and 
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0,  *  q  x  q  zero  matrix.  Then  the  discrete  state  space  matrices  of  the  system  are  given  by 


A  *  D'XW2P\H„ ( 1 ) QlD'iW1 

B  =  D\,2Q\Em  (4) 

C  =  E[PtD\/2 


Then  the  system  equations  are 

x(*  +  1)  =  Ax(k)+Bu(k) 

(5) 

y(*)=Cx(*) 

4 2.  Observer  Method  of  Calculating  the  Markov  Parameters  of  the  System 

For  flexible  structures,  it  is  very  difficult  to  generate  and  accurately  record  the  Markov  parameters  due  to 
the  small  magnitude  and  low  damping  of  a  typical  pulse  response.  The  difficulty  is  compounded  for  MIMO 
systems.  Hence,  asymptotically  stable  observer  formulation  [93]  as  outlined  below  is  used  to  calculate  the 
Markov  parameters  of  a  system  from  its  response  to  a  random  input. 

The  state  variable  equations  for  a  system  with  an  observer  can  be  written  as 

S(*+l)  =AX(k)+Bv(k) 

y(k)  =  CJt(k)  +Du(k)  ° 


where 


A  =  A +MC 
B  =  [B  +  MD.-M] 


v  (*)  * 


u(k) 

_y(k)_ 


(7) 


The  matrix  M  is  chosen  such  that  the  eigenvalues  of  A  are  placed  at  any  desired  location.  The  eigenval¬ 
ues  of  A  can  be  chosen  to  be  either  real  or  complex  conjugate  pairs.  An  outline  of  the  formulation  to  calcu¬ 
late  the  system  Markov  parameters  with  real  eigenvalue  assignment  of  the  observer  is  described  below.  The 
formulation  for  complex  eigenvalue  assignment  and  a  combination  of  real  and  complex  eigenvalue  assign¬ 
ment  follows  similar  steps. 

The  input-output  relation  of  the  above  observer  system  can  be  written  as 
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(8) 


f-\  p-\ 

y{k)  =  +pXX»»>(*-x-l) +D«(*)  k*p 

t  *  0  t  »  0 


where  fit,  {J,  D  are  the  unknown  observer  parameters  and  X’s  are  given  by 


K"=IKIK'L . KV 

=  . KV 

Kl  =  *°g(K' . K")m„ 


Eq.(8)  can  be  written  as 


where 


y(k)  =yT(k-i) 

?  =  [ct,  §,  r>] 

y  =  matrix  of  unknown  observer  parameters 


The  unknown  matrix  of  observer  parameters  can  be  calculated  recursively6  using  the  relation 


f(0  =  t(«-l)+A?(i-l) 


where. 


A?(/-l)  = 


9?(«-2)r(«-n  iy(o-}a- i)r(«-i)i 
i+r(«-i)T,3i(i-2)r(«-i) 


T-\T 


51  (i- 2)  =  51  (i-  1)  - 


The  initial  value  of  ? (i)  and  51  (i  -  2)  are 


K(i-2)r(«-Dr(i-l)r*(«-2) 

l+r(i-l)r9l(i-2)r(«-l) 


f(0  x  Imu+Itf +m| 

^(*~2)  —  10000  x  /(n**^**)  x  i  ««,+!,»+■) 


(9) 


(10) 


(ID 


(12) 


(13) 


(14) 


Once  the  observer  parameters  are  calculated  using  Eq.(ll),  Eq.(12)  and  Eq.(13),  the  Markov  parameters 
of  the  system  can  be  calculated  using  the  following  relation 


it  =  aK',+pflK0y<-i-i+K,D 

U»o 


(15) 
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4.3.  The  MDVV  Linear  System  Identification  Algorithm 

This  algorithm  to  identify  a  linear  state  space  model  of  a  system  from  experimental  test  data  was  sug¬ 
gested  by  Moonen,  DeMoor,  Vandenberghe  and  Vandewalle  (MDVV).  [94] 

The  MDVV  algorithm  is  based  on  the  system  identification  method  using  the  canonical  variate  analysis 
by  Larimore.  [95 J  The  algorithm  is  a  2  step  procedure  to  get  a  minimal  order  state  space  realization  of  a  sys¬ 
tem  from  experimental  random  input  output  data.  First,  a  state  vector  sequence  is  realized  as  the  intersection 
of  the  row  spaces  of  two  block  Hankel  matrices  constructed  using  the  input  output  data.  Then  the  system 
matrices  are  obtained  from  the  least-squares  solution  of  a  set  of  linear  equations  involving  the  states  calcu¬ 
lated  in  the  first  step  and  the  input  output  data.  The  steps  involved  in  the  MDVV  algorithm  are  as  follows 

Step  1 .  Form  Hl ,  H2  and  H  as  shown 


-  u(k) 

u(Jfc+  1) 

u(k+j-  1)  - 

y(k) 

y(k+ 1) 

V 

/-v 

*r- 

+ 

1 

u(k+  1) 

u(k  +  2) 

u(k+j ) 

y(k+ 1) 

y(k+ 2) 

y(k+j) 

(16) 

U  (Jfc  +  1  -  1) 

u(k  +  i) 

u  (k  +;'  + 1  -  2) 

_y(k  +  i-  1) 

y(k  +  i) 

y  (k  +j + 1  -  2)  _ 

u(k  +  i) 

u(k  +  i+  1) 

u(k  +  i+j-  1)  * 

y(k  +  i) 

y(k  +  i+  1) 

y(k  +  i+j-  1) 

u(k  +  i+  1) 

«(Jfc  +  l'  +  2) 

u(k+i+j) 

y(k  +  i  +  i) 

y(Jfc  +  t  +  2) 

y(k  +  i+j) 

(17) 

u(k  +  2i  -  1) 

u  (k  +  2i) 

u(k+j  +  2i -  2) 

y(k+2i-l) 

y(k  +  2i) 

y(k+j  +  2i-2)  _ 

r»ii 

(18) 

where  u  (k)  and  y(k)  are  m  and  q  dimensional  vectors  of  system  input  and  output  respectively. 
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Step  2.  Form  the  singular  value  decomposition  of  H 


H  =  USVT 


eT 

1 - 

_ 1 

rsn 

O' 

l - 

S? 

^22J 

0 

_ 1 

0. 

(19) 


here  the  sizes  of  the  matrices  are 

iize(Sjj)  =  (2 mi  +  n)  x  (2mi  +  n) 
size(Un)  «  ( mi  +  qi )  x  ( mi  +  qi ) 
size  (U 12 )  *  (mi  +  qi)  x  (2 qi-n) 
size  (U j j)  =  (mi  +  qi)  x  (2m/  +  n) 
size(U2 2)  «  (mi  +  qi)  x  (2 qi-n) 


T 

Step  3.  Form  the  singular  value  decomposition  of  the  product  ^12^1 1*^1 1  25  s^own 


=  [C/Xl 


rs, 

.0 


,Tl 


Vr± 
Lvq  J 


where  U '  is  a  (2qi  -n)  Xn  matrix. 

Step  4.  Then  the  identified  nth  order  model  is  given  by 


rA  si  = 

UqUy +  ^  +  (,+  1)  (m  +  q);:)S 

Lc  dJ 

_U (mi  +  qi  +  m+  1 :  (m  +  q)  (i+  1),  :)5_ 

UTqU\2U(\:(m  +  q)i,:)S 

JJ  (mi  +  qi+l  :mi  +  qi  +  m,: )  5. 

t 

(20) 


(21) 


(22) 


4.4.  Swept  Sinewave  Testing  Method 

This  is  a  conventional  method  to  obtain  a  transfer  function  model  of  a  system  from  experimental  test 
data.  In  this  method,  a  system  is  excited  using  sinusoidal  inputs  within  a  certain  frequency  range.  The  output 
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of  the  system  is  recorded  and  the  magnitude  and  phase  is  plotted  against  frequency.  A  curve  fitting  technique 
is  used  to  generate  a  transfer  function  model  of  the  system  from  the  recorded  frequency  response  plot  (bode 
plot).  The  whole  procedure  of  recording  the  data  followed  by  the  curve  fitting  analysis  was  automated  using 
the  HP  35665A  spectrum  analyzer. 

4.5.  Experimental  Results  on  the  Cantilever  Beam  Tfest  Article 

The  three  methods  viz.  the  Observer-ERA  method,  the  MDVV  method  and  the  swept  sinewave  testing 
method  were  used  to  generate  a  model  of  the  cantilever  beam.  For  the  Observer-ERA  and  the  MDVV  algo¬ 
rithm  the  experimental  data  used  was  the  response  of  the  beam  to  a  uniformly  distributed  random  input  for 
60  seconds  at  5  Hz  sampling  frequency.  In  the  Observer-ERA  algorithm,  various  sets  of  observer  poles  and 
window  size  were  selected  and  the  response  of  the  model  generated  using  each  of  the  sets  was  compared  to 
the  frequency  response  of  the  beam.  Finally,  we  selected  observer  poles  at  -  0.9  ± y'0.2,  -  0.8  ±  j0.3 1  and 
±0. 1  and  a  window  size  p  of  90.  The  frequency  as  well  as  the  time  response  of  the  model  generated  using 
this  set  matched  that  of  the  beam  over  the  frequency  range  of  interest  as  shown  in  Figures  4.1  and  4.2. 
Another  experimentally  derived  model  was  generated  using  a  HP35665A  spectrum  analyzer.  This  model  was 
produced  using  sinusoidal  inputs  between  0.1  Hz  and  2  Hz  and  the  curve  fitting  software  available  with  the 
analyzer.  This  model  was  called  the  swept  sine  model  (SSW).  The  third  model  was  obtained  using  the 
MDVV  algorithm  on  the  same  sequence  of  input  output  data  that  was  used  in  the  Observer-ERA  algorithm. 

For  comparison,  the  Markov  parameters  generated  by  the  Observer-ERA  method,  and  the  MDVV  algo¬ 
rithm  are  plotted  in  Fig.  4. 1  along  with  the  pulse  response  of  the  swept  sine  model.  For  clarity,  only  the  first 
20  Markov  parameters  have  been  shown.  The  symbols  on  the  plot  show  the  values  of  the  calculated  Markov 
parameters. 


Once  the  Markov  parameters  have  been  calculated,  the  ERA  can  be  used  to  identify  a  model  of  the  test 
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article.  The  discrete  time  model  (T  =  0.2  sec)  of  the  cantilever  beam  generated  using  the  Observer-ERA 
method  is  given  by 


<J>  3E 

-0.1956  -0.9409  0.1873_ 
0.9409  -0.3737  -0.08% 

II 

•2 

5 

CO. 

0.0366* 

-0.0416 

.0  1873  0  08%  0.5492_ 

.0.0311. 

C»b,  =  [0.0366  0.0416  0.03  ll] 


(23) 


The  equivalent  continuous  time  transfer  function  of  the  system  in  Eq.  (22)  is 


GobAs) 


-  0.0074J2  +  0.007 1  5  +  1 .0233 
53  +  2.8 153  +  88.98445  +  237.34% 


(24) 


In  comparison,  the  discrete  time  state  space  model  (T  =  0.2  sec)  of  the  cantilever  beam  generated  using 
the  MDVV  algorithm  is 


-0.1642 

-9.35%  -19.318 

*0.1869* 

<T>  = 

^ mdvv 

0.0836 

0.064  -1.9318 

0.0187 

.0.0084 

0.1064  0.8068. 

0.0017. 

C , !4vv  [-0.0008  -0.0544  -0.5853] 


(25) 


The  continuous  time  transfer  function  equivalent  of  Eq.  (24)  is 


G, ,<*,»  (s) 


0.01 52  -  0.00595-  1.3482 
53  +  2.730252  +  88.86545  +  23 1 . 1 2 


(26) 


Finally,  the  Swept  Sine  Model  is  given  by  Eq.  (26) 


C(5) 


_ L55 _ 

53+  1. 65025* +  88.9255  +  136.8651 


(27) 


The  comparison  of  the  frequency  response  of  the  three  models  in  Eqs.  (23),  (25)  and  (26)  and  the  mea¬ 
sured  frequency  response  of  the  beam  is  shown  in  Fig.  4.2. 
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Figure  4.2  shows  that  there  is  a  acceptable  match  between  the  frequency  of  the  first  mode  of  the  three 
models  and  the  measured  frequency  response  of  the  beam.  From  Fig.  4.1  and  4.2  we  can  conclude  that  any  of 
the  three  mathematical  models  obtained  is  a  plausible  structural  model  of  the  cantilever  beam  system. 

4.6.  Experimental  Results  on  the  Three  Mass  System 

An  experimental  model  was  obtained  for  the  test  article  from  frequency  response  data  acquired  with  a 
HP3S66SA  dynamic  signal  analyzer.  Four  transfer  functions  were  obtained  for  the  two  input-two  output  sys¬ 
tem.  For  each  measurement,  the  analyzer’s  source  sweeps  the  desired  frequencies  in  a  series  of  very  small 
steps.  The  input  signal  power  is  automatically  adjusted  at  each  frequency  in  the  sweep  to  provide  the  optimal 
signal  to  noise  ratio.  An  advantage  of  this  swept  sine  measurement  is  that  all  of  the  energy  in  the  frequency 
response  test  is  concentrated  at  the  desired  frequency  which  gives  good  excitation  of  the  structural  system 
[96,97,98].  Modal  frequencies  taken  from  the  bode  plots  of  the  system  are  listed  in  Table  (4.8).  From  this 
table  it  can  be  seen  that  the  frequency  of  the  modes  is  a  function  of  the  actuator  used  to  excite  the  beam.  The 
frequency  of  the  modes  is  also  dependent  upon  the  initial  tension  in  the  SMA  wires. 


Table  4.8. 3-Mass  System  Modal  Frequencies 


Mode 

Experimental  •  Force  Actuators  (Hz) 

Experimental  •  Moment  Actuators  (Hz) 

1 

0.96 

0.90 

2 

2.54 

2.37 

3 

3.31 

3.09 

For  each  of  the  four  transfer  functions,  a  model  was  generated  by  the  HP  analyzer’s  built-in  curve  fitting 
program.  The  curve  fitting  program  derives  a  linear  system  model  from  the  measured  frequency  response 
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data.  A  pole/zero  model  is  developed  by  calculating  a  weighted  least  squares  fit  of  the  frequency  response 
data  to  a  rational  polynomial.  The  curve  fitting  is  performed  in  the  s-domain  so  that  the  program  determines 
the  transfer  function.  Due  to  control  structure  interaction,  each  actuator  generates  two  8th  order  transfer 
functions  with  similar  complex  poles.  A  common  denominator  for  all  4  transfer  functions  was  then  generated 
by  compromising  between  the  two  sets  of  frequencies.  [99]  The  measured  transfer  functions  and  the  result¬ 
ing  “fitted”  curves  are  shown  in  Fig.  4.3. 


Frequency  (Hz)  Frequency  (Hz) 

Sensor  1  Response  of  Force  Input  Sensor  1  Response  of  Moment  Input 


Frequency  (Hz)  Frequency  (Hz) 

Sensor  2  Response  of  Moment  Input  Sensor  2  Response  of  Moment  Input 

Figure  4.3.  Measured  Transfer  Functions  (Dashed)  and  Fitted  Curves  (Solid) 


The  corresponding  numerators  were  modified  so  that  the  four  transfer  functions  could  be  written  as 


where 


(28) 
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4(5)  =  5'  +  2.89j7  +  6.82  x  10V  +  1.77x  10V+1.21  x  10V  +  2.75x  10V  +  3.64x  10V  +  7.42x  10*5  +  3.95x  10‘ 
Nn  =  8.48  x  1(T'.S5  + 2.535* +  3.07  x  10V+  1.17  x  1035J+  1.32  x  10*5 +  7.68  x  10* 

Nn  —  -3.14  x  10"‘j3 -  1.64  x  lO^r2  - 2.42  x  1035 -  l.lOx  103 
N2l  =  2.18  x  10'V  +  4.59x  10*1j3  +  2.06j*-  1.11  x  10V  -  3.24  x  10V  -  3.19  x  10J5-  1.04  x  10J 
Nn  =  -  1.9655x  -9.85X5*  -5.31  x  10253  -  1.36  x  10V  -  1.12  x  1035  -  2.81  x  103 


4.7.  Determination  of  State  Variable  Representation  from  Transfer  Function  Matrix  [99,100] 

Even  thought  the  transfer  function  matrix  has  been  determined,  the  a  state  space  representation  is  needed 
for  the  model  so  that  LQG/LTR  controllers  can  be  designed.  The  desired  form  of  the  model  is  contained  in 
Eq.  (29). 


S  =  ApS  +  BPU 
9  =  CpX  +  Dpy 


(29) 


The  state  space  realization  will  be  a  controller-form  realization  from  a  right  matrix  fraction  description 
(MFD).  First,  the  TFM  must  be  in  the  form  of  a  strictly  proper  right  MFD  having  the  form  given  in  Eq.  (30). 


H(s)  =  N(s)D-l(s) 


(30) 


In  order  to  obtain  a  strictly  proper  right  MFD,  the  TFM  was  transformed  into  a  Smith  McMillian  form 
given  in  Eq.  (31). 


H(s)  = 


A  (5) 
d(s) 


(31) 


After  performing  pole/zero  cancellations  in  the  Smith  McMillan  form,  the  minimal  order  of  the  system 
was  determined  to  be  14.  Equation  (32)  contains  the  minimal  order,  column  reduced  transfer  function  matrix. 

W(5)  =  N,(s)D;'(s)  (32) 


Once  the  TFM  is  in  the  form  of  a  strictly  proper  right  MFD,  a  controller  form  state  space  realization  can 
be  obtained.  First,  rewrite  Dr(s)  as  shown  in  Eq.  (33). 

z?r(5)  =  D,rS(5) +0,^(5)  (33) 


where, 
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S(s)  «  diag  (A  A 


^(5) 


A*  A2  ...  j  1 


5**-'  5***1  ...  5  1 


and  kj,  k2  are  the  column  degrees  of  Dr(s) ,  D*C(J)  is  the  highest  column  degree  coefficient  matrix  of  Dr(s) 

Once  the  matrices  in  Eq.  (33)  are  determined,  the  state  space  realization  can  be  formed  using  the  matrix 
definitions  given  in  Eqs.  (34)  through  (36). 


=  block  diag( 


0  0  0 

1  ...  0 

0  1  0. 


kjXk,,  i  -  1,2  ) 


(34) 


(B°p) T  *  block  diag«l  0 . . .  0],  1  x  kt,  i  =  1, 2) 


(35) 


=  (36) 

Finally,  the  state  space  matrices  are  obtained  using  Eqs.  (37)  through  (39) 

Ap(j)  =  A0p-B^Dl\Du  (37) 

Bp  =  Bp;'  (38) 

C,  =  N,c  (39) 


where. 


Nr  =  NuV{s) 


4.8.  Formulation  of  Reduced  Order  State  Space  Model  [101] 

Next  to  reduce  the  order  of  the  controller  and  allow  for  the  modelling  and  control  of  complex  systems,  a 
reduced  order  model  was  developed.  The  reduced  order  model  of  the  system  was  acquired  utilizing  the  bal¬ 
ance  and  truncation  methodology.  First,  an  internally  balanced  representation  of  the  system  is  obtained  in 
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which  the  controllability  and  observability  Gramians  (given  in  Eq.  (40))  are  equal  and  diagonal. 


I  *  diag(Oita . . o.) 


(40) 


with 


(41) 


The  o,’s  are  Hankel  singular  values.  The  order  of  the  reduced  order  model  is  determined  by  retaining  the 
dominant  Hankel  singular  values.  Once  the  dominant  values  have  been  determined,  the  state  space  model  is 
partitioned  as  in  Eq.  (41) 


4 bal  &htl 
fbMl 


4|j 

^2* 

*1 

[c.e3  D 


(42) 


where  A,,  e  R'*r,  Au  e  r  is  the  number  of  dominated  Hankel  singular  values,  and  (n-r) 

is  the  number  of  nondominate  values. 


Finally,  after  the  partitioning  is  completed,  the  reduced  order  model  is  determined  as  in  Eq.  (43) 


Sr  =  A£r  +  Brg 
9,  -  C£r  +  DrU 


(43) 


where. 


Ar  =  A,(;  Br  —  Bx 

C,  =  C,  Dr  =  D  (44) 

£r  =  reduced  order  state  vector 

4.9.  Modelling  Results 

A  comparison  of  the  frequency  response  of  the  transfer  function  matrix  and  the  reduced  order  model  is 
given  in  Fig  4.4.  This  figure  shows  that  the  reduced  order  model  produces  a  close  representation  of  the 
response  of  the  transfer  function  matrix. 
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Sensor  1  Response  of  Force  Input 


Sensor  2  Response  of  Force  Input 


■o 

j 


CO 


! 


Sensor  1  Response  of  Moment  Input 


Frequency  (Hz) 


Sensor  2  Response  of  Moment  Input 


(solid  -  reduced  order  model,  dashed  -  TFM) 


Figure  4.4.  Comparison  of  Frequency  Response 


The  next  figure  shows  the  comparisons  of  the  frequency  response  of  the  models  obtained  using  the 
Observer-ERA  algorithm,  MDVV  algorithm  and  the  frequency  response  of  the  three  mass  system  recorded 
by  the  HP  35665A  spectrum  analyzer. 
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Figure  4.5.  Comparison  of  Frequency  Responses 

Figures  4.4  and  4.5  shows  that  the  state  space  models  obtained  from  the  Observer-ERA,  the  MDVV 
algorithm  and  the  swept  sine  testing  method  give  a  fairly  good  representation  of  the  three  mass  structure. 
Therefore  any  of  the  above  models  can  be  used  to  design  conventional  as  well  as  robust  controllers. 
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5.0  Controller  Design  Methodologies 


The  design  and  implementation  of  control  strategies  for  large,  flexible  smart  structures  presents 
challenging  problems.  To  demonstrate  the  capabilities  of  shape-memory-alloy  actuators,  we  have 
designed  and  fabricated  a  three-mass  test  article  with  multiple  shape-memory-alloy  (NiTiNOL) 
actuators.  The  force  and  moment  actuators  were  implemented  on  the  structure  to  examine  the  effects  of 
control  structure  interaction  and  to  increase  actuation  force.  These  SMA  actuators  exhibit  nonlinear 
effects  due  to  deadband  and  saturation.  The  first  step  in  the  modeling  process  was  the  experimental 
determination  of  the  transfer  function  matrix  derived  from  frequency  response  data.  A  minimal  state 
space  representation  was  determined  based  on  this  transfer  function  matrix.  Finally  in  order  to  reduce 
the  order  of  the  controller,  a  reduced  order  state  space  model  was  derived  from  the  minimal  state  space 
representation.  The  simplified  analytical  models  are  compared  with  models  developed  by  structural 
identification  techniques  based  on  vibration  test  data. 

From  the  reduced  order  model,  a  controller  was  designed  to  dampen  vibrations  in  the  test  bed. 
To  minimize  the  effects  of  uncertainties  on  the  closed-loop  system  performance  of  smart  structures,  a 
LQG/LTR  control  methodology  has  been  utilized.  An  initial  standard  LQG/LTR  controller  was 
designed;  however,  this  controller  could  not  achieve  the  desired  performance  robustness  due  to 
saturation  effects.  Therefore,  a  modified  LQG/LTR  design  methodology  was  implemented  to 
accommodate  for  the  limited  control  force  provided  by  the  actuators.  The  closed-loop  system  response 
of  the  single  input-single  output  and  multiple  input-multiple  output  (MIMO)  test  articles  with  robustness 
verification  have  been  experimentally  obtained  and  presented  in  the  sections.  The  modified  LQG/LTR 
controller  demonstrated  performance  and  stability  robustness  to  both  sensor  noise  and  parameter 
variations. 

5.1  Classical  Control  Design  Methods 

As  a  first  step  toward  the  design  of  closed  loop  controllers  for  smart  structures,  classical  control 
algorithms  have  been  designed  and  implemented  on  test  articles.  The  salient  results  are  presented  in 
this  section. 

5.1.1  Actuator  Dynamics 

The  first  task  was  to  model  the  Nitinol  wire.  The  wire  was  given  an  initial  3%  of  permanent 
strain,  mounted  in  a  test  fixture,  and  then  heated  past  the  phase  transition  temperature  via  electrical 
resistance  heating.  Figure  5.1  shows  a  typical  experimental  heating-cooling  cycle. 
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Figure  5.1.  Typical  heating-cooling  cycle  of  Nitinol  wire 

This  resulting  force-time  history  indicates  that  a  first-order  model  is  an  appropriate 
approximation,  in  the  form  of  Eq.  ( 1 ) 

xF+F=abP  (1) 

where  F  =  force,  x  =  time  constant,  ab  =  slope  of  force-vs-power  curve,  P  =  V2/R  =  power  supplied 
to  wire,  V  =  voltage,  and  R  =  electrical  resistance  of  the  wire.  Figure  5.2  shows  the  experimentally- 
derived  curve  of  t  vs.  P.  The  value  of  x  selected  was  x  =  5  sec  on  the  basis  that  typical  power  levels 
were  in  the  2-4  watt  range.  Figure  5.3  shows  a  typical  value  of  steady-state  force  vs.  power  having  a 
slope  of  about  1.25  lb/watt  for  a  42-inch  Nitinol  wire  having  an  initial  permanent  strain  of  3%. 
Equation  ( 1 )  gives  a  nonlinear  relation  between  force  and  voltage.  The  voltage  was  supplied  by  a 
power  driver  for  which  the  voltage  V  was  given  by  V  =  2.2  Vpc,  where  Vpc  was  the  output  of  the  PC 
which  was  used  to  generate  the  control  action.  Hence,  Eq.  (1)  has  the  form  (for  the  42-inch  wire 
which  had  a  resistance  of  18  Q) 

5F  +  F=0.336Vj.  (2) 


Time  Constant  (sec.) 


Fig  2.  Time  constants  of  3%-strained  Nitinol  Wire  Fig  3.  Force  curve  of  42"  wire 

(10  mils) 

Equation  (2)  is  a  nonlinear  equation  between  force  and  voltage.  Linearization  of  Eq.  (2)  about  a 
nominal  voltage  of  =  2.5v  leads  to 


5f+f=1.68Vpc 


(3) 


to  which  corresponds  the  transfer  function 


Ga(s)  = 


0.336 
s  +  0.2 


(4) 


When  tested  experimentally  only  3.75  lb  of  force  was  generated  when  the  D/A  port  of  the  PC  applied  5 
volts  to  the  wire.  This  led  to  a  modification  of  the  numerator  of  Ga(s)  so  that  Eq.  (4)  was  replaced  by 
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(5) 


Ga(s)  = 


s  +  0.2 


It  is  this  empirical  Ga(s)  which  was  used  to  represent  the  actuator  in  the  controller  design  for  the 
cantilever  beam.  Similar  reasoning  led  to 


Ga(s)  = 


s  +  0.2 


0.302 
s  +  0.2 


for  the  54"-wire 


for  the  24"-wire. 


These  two  latter  wires  were  used  in  the  three-mass  simulation  and  experiment. 

Another  experiment  performed  on  the  Nitinol  wire  had  to  do  with  how  the  generated  force  was 
affected  by  the  frequency  at  which  applied  voltage  was  turned  on  and  off.  As  this  frequency  was 
increased,  the  effective  force  that  could  be  generated  decreased,  mainly  because  there  was  not  sufficient 
time  for  the  wire  to  cool  before  it  was  reheated.  This  tended  to  inhibit  the  transition  back  and  forth 
between  the  martensite  and  austenite  phases.  In  the  experiment  no  artificial  means  were  used  to  cool 
the  wires,  such  as  blowing  air  over  the  wires  by  means  of  fans.  Table  5.1  shows  the  effect  of 
frequency  on  the  effective  force  that  could  be  generated  by  the  54"  Nitinol  wire.  Clearly,  its 
effectiveness  in  the  present  application  appears  to  be  limited  to  systems  with  frequencies  below  2  Hz. 
The  transfer  functions  in  Eqs.  (5)  and  (6)  do  produce  the  type  of  behavior  exhibited  in  Table  5. 1 . 

Table  5.1.  Effective  Force  of  Nitinol  Wire  (54  inch) 
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5.1.2  Cantilever  Beam 


Two  42-inch  Nitinol  wires  were  used  on  each  side  of  a  cantilevered  aluminum  beam  of  length 
47".  The  beam  was  6"  wide  and  0. 125-inch  thick  with  a  tip  mass  of  maximum  weight  6.44  lb  which 
was  used  to  vary  the  fundamental  frequency  of  the  beam.  Figure  5.4  shows  a  schematic  diagram  of  the 
experiment.  Two  Nitinol  wires  ran  the  length  of  the  beam  and  were  alternately  heated  and  cooled  to  act 
in  a  differential-type  actuator  mode.  A  strain  gage  mounted  at  the  root  of  the  beam  was  used  as  a 
sensor. 

5.1.3  Beam  Transfer  Function 

A  finite  element  model  was  formulated  for  the  cantilever  beam.  Three-element  and  10-element 
models  gave  virtually  identical  results  for  the  lower  frequencies.  Table  5.2  shows  a  comparison 
between  the  FE  calculations  and  experimental  results  for  three  different  tip  masses.  It  is  evident  that 
only  the  fundamental  mode  is  capable  of  being  affected  by  the  shape  memory  action  of  the  Nitinol  wire. 
Hence,  the  beam  experiment  was  limited  to  a  single-input/single-output  control  problem.  To  aid  in  the 
design  of  the  controller,  the  transfer  function  for  the  fundamental  mode  of  the  cantilever  beam  must  be 
formulated. 


Table  5.2.  Comparison  of  Natural  Frequency  (Hz) 


Tip  Mass 
(lb) 

Mode 

Experiment 

NASTRAN 

0.00 

1 

1.60 

1.75 

2 

10.8 

11.3 

3 

30.0 

32.0 

1.146 

1 

1.20 

1.17 

2 

8.80 

9.21 

3 

26.0 

27.7 

6.437 

1 

0.58 

0.62 

2 

7.20 

8.62 

23.2 

22.1 

The  differential  equation  (based  on  the  finite  element  model)  for  the  fundamental  mode  was 
computed  to  be 


q,  +  0.0235  q,  +  15.3lq]  =  1.684ma(t) 


(7) 
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where  qi  is  the  generalized  coordinate  for  the  fundamental  mode.  An  experimentally-measured 
damping  ratio  of  0.30%  has  been  added  to  the  equation.  On  the  right  side  of  Eq.  (7),  ma(t)  is  the 
moment  applied  by  the  Nitinol  actuators.  The  actuators  stand  off  the  neutral  axis  of  the  beam  by  1.5", 
and  the  effect  of  the  axially-directed  actuator  force  (maximum  of  about  5  lb)  is  neglected.  If  f(t)  is  the 
force  generated  by  the  Nitinol  wire,  then  ma(t)  =  1 .5  f(t).  Furthermore,  in  the  transformation  from 
global  coordinates  to  the  modal  coordinates  that  generated  Eq.  (7),  the  tip  displacement  yx  turned  out  to 
equal  qi ,  so  that  the  equation  for  the  beam  tip  displacement  is 

yx  +  0.0235  yT  +  15.31  yT  =  2.526f(t).  (8) 

This  tip  displacement  is  sensed  with  a  strain  gage  mounted  at  the  root  of  the  beam.  Hence,  the  voltage 
output  of  the  strain  gage  is  proportional  to  yv  and  is  given  by 

v  +  0.0235V  +  15.31v  =  2.526a  f(t)  =  af  f(t)  (9) 

where  af  =  2.526a,  and  a  is  the  constant  of  proportionality  between  the  strain  gage  output  and  the  tip 
displacement.  This  parameter  was  obtained  experimentally.  When  a  force  f(t)  =  2.7  lb  was  generated 
in  the  Nitinol  wire,  the  strain  gage  output  was  measured  at  0.01 1  volts.  Hence,  af  =  0.0624  volt/lb, 
and  Eq.  (9)  becomes 


v  +  0.0235V  +  15.3 1  v  =  0.0624f(t)  .  (10) 

Corresponding  to  Eq.  (10),  the  transfer  function  between  the  strain  gage  output  and  the  actuator  force  is 

r  ,cv _ 0-0624 

S2  +  0.0235s  +  15.31  ‘  {ll) 

Figure  5.5  shows  the  block  diagram  for  the  control  system,  including  the  Nitinol  actuator  and  beam 
dynamic  model. 

To  verify  Eq.  (11),  the  frequency  response  of  the  cantilever  beam  with  actuator  was  computed 
and  compared  with  experiment.  Figure  5.6  shows  a  Bode  plot  of  the  theoretical  model  and  Fig.  5.7  is 
a  Bode  plot  of  the  actual  system.  The  comparison  between  them  is  quite  acceptable.  It  is  noted  that  the 
phase  angle  is  -45°  at  a  frequency  of  0.2  rad/sec.  This  would  correspond  to  the  (s+0.2)  factor  in  the 
denominator  of  the  transfer  function  for  the  actuator. 
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Fig.  5.4  Experimental  Setup 
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Fig.  5.5  Block  Diagram  of  Cantilever  Beam 
with  Actuator 
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Fig.  5.6  Theoretical  Bode  plot 

Fig.  5.7  Experimental  Bode  plot 

Simulation 


Several  control  algorithms  were  employed  in  the  simulation  of  the  entire  system.  The  baseline 
was  an  open-loop  system  with  the  Nitinol  wires  attached,  but  inactive.  Then  an  "on-off'  control 
algorithm  was  examined,  as  well  as  proportional,  proportional-plus-integral,  and  robust.  The  robust 
controller  was  designed  using  standard  MATRIX*  software.  Figures  5.8  -  5.12  show  the  simulated 
open-loop  response,  and  responses  for  the  on-off,  P,  PI,  and  robust  controller,  respectively.  The 
gains  of  the  classical  P  and  PI  control  were  selected  L>y  vtjying  them  so  as  to  obtain  the  best 
performance  in  terms  of  smallest  decay  time,  without  going  unstable.  The  gains  used  were  Kp  =  -350 
for  P-control,  and  Kp  =  -200,  K,  =  100  for  PI  control.  A  Routh-Hurwitz  stability  analysis  for  the  P- 
controlled  system  indicated  stability  for  (-327)  <  Kp  <  38.  Because  of  small  differences  between  the 
simulation  and  the  actual  system,  the  lower  limit  on  the  experimental  Kp  was  actually  about  (-350) 
instead  of  the  theoretical  value  of  (-327).  In  both  simulation  and  experiment,  the  negative  gain  gave 
better  results  than  did  the  positive  gain. 


Experimental  results 


Figures  5.13  -  5.17  show  the  experimental  results  for  the  open-loop  response,  and  the  on-off, 
P,  PI,  and  robust  controllers,  respectively.  The  free  vibration,  in  all  cases,  was  induced  by  an  initial 
displacement  of  the  tip  of  the  beam.  The  tip  mass  was  at  maximum  value  of  6.44  lb  to  give  the  beam  a 
fundamental  frequency  of  0.623  Hz.  Comparisons  with  the  corresponding  simulation  curves  shows 
good  correlation.  This  is  summarized  in  Table  5.3,  where  (for  convenience)  the  time  required  to  reduce 
the  vibration  amplitude  to  10%  of  its  initial  value  is  used  as  a  measure  of  effectiveness  of  the  control 
algorithm.  The  corresponding  open-loop  decay  time  was  220  seconds. 


Table  5.3.  Summary  of  10%  Settling  Time 


Controller  Type 

■ 

hkSBHI 

Experimental  Results 
(sec) 

On-Off 

37 

38 

P 

22 

24' 

PI 

36 

34 

LOG/LTR 

17 

5.1,4  Three-Mass  System 

To  work  with  an  MIMO  case,  a  three-mass  system  was  constructed,  as  shown  in  Fig.  5.18. 
The  three  weights  were  W,  =  23.2  lb,  W2  =  19.3  lb,  and  W3  =  15.4  lb.  The  three  springs  had  spring 
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lengths  of  23",  19",  and  18",  respectively.  The  upper  spring  was  2"  x  0.090".  and  the  lower  two 
were  2"  x  0.080".  The  weights  were  steel;  the  springs,  aluminum.  Two  sets  of  Nitinol  wires  were 
used:  54-inch  wires  running  lengthwise,  and  24-inch  wires  cris  crossing  above  the  uppermost  mass, 
as  shown  in  Fig.  5.18.  Figure  5.18  also  shows  a  schematic  of  the  experimental  setup.  Two  full- 
bridge  1000-ohm  strain  gages  were  used  as  sensors  and  were  located  in  the  top  and  bottom  springs  as 
indicated  in  Fig.  5.18. 
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Fig.  5.8  Open-loop  simulation  with 
inactive  wires. 


Fig.  5.9  Simulation  of  On-Off  controller 
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Fig.  5.16  Experimental  response  with  Pi-controller 


Fig.  5.17  Experimental  response  with  LQG/LTR 
controller 


Fig.  5.18  Block  diagram  of  three-mass  system 
experimental  set-up. 


Fig.  5.19  Experimental  frequency  response  of 
3-mass  system. 


A  finite  element  model  with  15  nodes  and  45  degrees  of  freedom  was  used  to  model  the  three- 
mass  structure.  Table  5.4  shows  the  first  10  computed  natural  frequencies.  Only  the  first  three,  which 
involved  displacements  of  the  three  masses,  were  within  the  bandwidth  of  the  Nitinol  actuators. 
Higher  frequencies  corresponded  to  spring  resonances  wherein  the  mass  displacements  were  very 
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small,  if  not  zero.  Hence,  only  the  first  three  modes  were  used  in  formulating  the  state-variable  model. 
Figure  5. 19  shows  the  frequency  response  of  the  actual  structure,  as  determined  by  the  impact-hammer 
method.  The  three  peaks  occur  at  0.76,  2.00,  and  2.92  Hz,  respectively.  The  agreement  between 
theory  and  experiment  is  good.  The  stiffening  effects  of  the  relatively-large  static  weight  of  the  3 
masses  were  included  in  the  finite  element  calculations. 

Table  5.4.  MSC/NASTRAN  Output  for  Three-Mass  System 


FREQUENCY 

(Hz) 


GENERALIZED 
STIFFNESS  (Ib/in) 


35.450 

43.633 

43.782 

46.028 

46.213 

130.45 


0.0871 

0.1076 

0.0906 

0.0010 

0.0010 

0.0008 

0.0008 

0.0007 

0.0007 

0.0936 


51.812 

57.358 

58.228 

60.470 

61.524 

62859. 


The  equations  for  the  first  three  modes  are  given  by  (damping  with  C,  =  0.003  has  been  added 


to  the  FE  result) 


0.087  0.  0. 


0.  0.108  0. 


0.  0.  0.091  J  ^  q3 


1.9  0.  0.  |  f  q, 


+  0.  15.8  0. 


0.  0.  28.  J  ^  q3 


0.002  0.  0. 


q2  f  +  0.  0.0051  0. 


0.  0.  0.0072  J  v  q3 


-0.45  -0.017 


•1.0  0.108 


0.654  0.129 


where  qj  are  the  modal  generalized  coordinates,  f2  represents  the  effective  transverse  force  from  the  24" 
crisscrossed  Nitinol  wires,  and  m5  the  moment  supplied  by  the  longer  54"  actuators  (as  in  the  case  of 
cantilever  beam). 

Equation  (12)  can  be  put  in  state-variable  form 

{x}  =  [As]{x}  +  [Bs]{u} 

(13) 

{y}  =  [Cs)  {x} 
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by  defining  a  vector  { x  }T  =  [qi  q2  q3  qi  q2  qs].  The  first  of  Eq.  (13)  then  takes  the  form 


0  I 

0 

'  f:  ' 

{*}  = 

-M  *K  -M  ‘C 

{x}  + 

M‘BS 

< 

.  m5  . 

where  M,  C,  K  are  matrices  given  in  Eq.  (12).  The  output  {y }  in  Eq.  (13)  are  strain  gage  readings 

{ y}T  =[e1  e2  e3  ...]  (15) 

To  illustrate  how  these  are  computed,  consider  a  case  of  3  strain  gages  mounted  on  beam 
elements,  as  shown  in  Fig.  5.20. 


Fig.  5.20  Beam  Elements  with  Three  Strain  Gages 

For  a  beam  bending  element,  the  standard  expression  for  the  transverse  deflection  anywhere 
within  the  element  is 


y(x)  =  [Ni(x),  N2(x),  N3(x),  N4(x)]  [yj  Oj  y2Q2]T  (16) 

where  yi,  ©i,  y2,  are  deflections  and  rotations  at  the  left  and  right  ends  of  the  element,  respectively, 
and  Nj(t)  are  standard  beam-bending  shape  functions.  The  corresponding  strain  measured  by  a  strain 
gage  at  x  =  xs  is  then 


€(x,)=-Zy"(xs)=-Z[NiVl+N;9,+NJy2+N:e2]  (17) 

ft 

where  (•)  denotes  the  second  derivative  of  Nj  evaluated  at  x  =  xs,  and  z  is  the  distance  from  the 
neutral  axis  of  the  beam  element  to  its  outer  fibers  where  the  gage  is  mounted. 
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Now,  suppose  that  there  are  3  gages,  as  indicated  in  Fig.  5.20.  Then 


or 

But 


{y}=-[zN"]{y}  . 

{y}  =  [Ta][qi  q2q3]T 


(19) 

(20) 


where  [TJ  is  an  abbreviated  version  (the  upper  left  portion)  of  the  modal  matrix  used  to  transform  the 
original  differential  equations  to  modal  coordinates.  Also, 


Thus,  the  output  equation  is 


where 


(y}  =  (Cs]  {x} 
[Cs]  =  -[zN"j 


(22) 

(23) 


Equations  (14)  and  (23)  constitute  the  state- variable  model  for  the  three-mass  structure. 


This  state-variable  model  must  be  augmented  by  a  model  for  the  actuators.  This  can  be 
formulated  as  follows.  Corresponding  to  the  transfer  functions  for  the  crisscrossed  24"  wire  and  the 
lengthwise  54"  wire,  the  actuator  forces  are  represented  by 


fs 

■  -0.2  0 

fs 

'  0.27 

0  ' 

V|  ' 

>  _ 
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>  + 

- 

.  0  -0.2  . 

.  0 

0.302  . 

.  v2  . 

(24) 


where  fs  is  the  actuating  force  generated  by  voltage  v,  applied  to  the  24"  wire,  and  f, 
is  the  force  generated  by  voltage  v2  applied  to  the  54"  wire.  Equation  (24)  can  be  written  as 


{fa}=[Aal{fa}+[Ba]{v) 


(25) 


where  {fa}  =  [fs  f*]T  and  { v }  =  [vt  v2JT. 

These  actuator  forces  apply  a  transverse  force  f2  and  a  moment  m5  to  the  3-mass  structure, 

where 


f  fa  ■ 

.. 

sin©  0 

< 

fs 

l  m5  . 

0  a 

,  ft. 

(26) 


and  where  0  is  the  inclination  to  the  vertical  of  the  crossed  24"  wires,  and  a  =  1 .5"  is  the  stand-off 
moment  arm  for  the  lengthwise  54"  wire.  Thus, 


'  fa 
.  m5 


ICJ  {fa} 


(27) 


Equations  (25)  and  (27)  constitute  the  state-variable  model  for  the  actuator. 


Finally,  the  augmented  state-variable  model  including  structural  dynamics  and  actuator 
dynamics  is 
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As  BsCa 
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>  = 
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...  > 
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.  0  Aa  . 

l  fa  J 

v  3  J 

.  Ba  . 

(28) 
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Strain  Gage  Output 
(  No.  1,  mV  ) 


x 


(29) 


{y}  =  [Cs  !  0] 


It  is  Eqs.  (28)  and  (29)  which  are  used  in  the  simulation  of  the  3-mass  system. 

5.1.5  Comparison  Between  Simulation  and  Experiment 

System  verification 

The  step-response  method  was  used  to  verify  the  mathematical  model  of  Eqs.  (28)  and  (29).  A 
simple  code  in  the  C-language  was  written  to  send  a  constant  command  input  voltage  from  the  D/A  port 
to  the  servo-amplifier.  Figures  5.21  and  5.22  show  the  experimental  results.  Figure  5.21  shows  the 
response  of  the  first  sensor  to  a  5-volt  step-input  to  the  force  actuator  (24"  wire),  and  Fig.  5.22  shows 
the  response  of  the  second  sensor  when  a  step-input  voltage  of  (-2)  volts  was  applied  to  the  moment 
actuator  (54"  wire).  Figure  5.23  shows  the  corresponding  simulation  of  the  experiment.  The  dotted 
line  represents  the  output  from  the  second  sensor  to  a  (-2)  volt  moment  actuation  and  the  solid  line 
represents  the  first  sensor  output  to  a  5-volt  force  actuation.  The  comparison  is  fairly  good. 
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Fig.  5.21  Experimental  step  response  of  sensor  No.  1. 


Fig.  5.22  Experimental  step  response  of  sensor  No.  2. 
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Fig.  5.23  Simulated  step  responses  of  3-mass  system. 
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5.2.  Design  of  Robust  Regulators  with  Arbitrary  Eigenvalue  Assignment 

It  is  well  known  that  given  a  system  modeled  by 

x  =  Ax  +  Bu  (30) 


y  =  Cx 


(31) 


using  the  feedback  control 


u  -  -Kcx 


(32) 


yields  the  closed-loop  regulator 


x  =  ( A  -  BKC)  x  »  $x  (33) 

The  feedback  gain  matrix,  Kc,  may  be  chosen  to  yield  any  desired  set  of  eigenvalues.  [102]  In  a  MIMO 
system  the  choice  of  Kc  is  not  unique  and  the  available  degrees  of  freedom  can  be  used  to  choose  eigenvec¬ 
tors  as  well  as  to  specify  the  choice  of  eigenvalues.  [102]  It  is  also  known  that  in  order  to  make  the  perfor¬ 
mance  of  the  resulting  regulator  as  insensitive  as  possible  to  variations  in  model  parameters,  the 
eigenvectors  should  be  chosen  to  be  mutually  orthogonal  or  as  close  to  orthogonal  as  possible.  [102,103] 

Because  the  regulator  for  this  particular  situation  is  intended  to  provide  active  damping  for  the  2  input-2 
output  system,  the  closed-loop  pole  placements  were  chosen  solely  to  improve  (increase)  the  damping  ratio 
of  all  lightly  damped  system  poles  without  altering  the  undamped  natural  frequencies.  Using  the  eighth  order 
model  discussed  above,  the  performance  of  two  different  regulators  has  been  investigated.  The  pole  place¬ 
ment  of  these  regulators  is  as  shown  in  Table  (5.5)  below.  This  table  gives  the  pole  locations  in  the  s-plane  in 
order  to  aid  the  reader  in  visualizing  the  effect  that  the  regulator  has  on  the  pole  placement. 


Table  5.5.  Regulator  Pole  Locations 


Model  Poles 

For  Damping 
Ratio  £  »  0.05 

For  Damping 
Ratio  £  =  0.1 

-0.160  ±  20.109i 

20.109 

0.0080 

-1.005  1  20.081 

-2.01 11 20.000i 

-0.3141 15.383* 

15.392 

0.0204 

-0.7731 15.441 

-  1.539115.3151 

-0.061  ±5.987« 

5.9875 

0.0102 

-0.279  1  5.981 

-0.599  1  5.9571 
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Table  5.5.  Regulator  Pole  Locations 


Model  Poles 

For  Damping 
Ratio  £  =  0.05 

For  Damping 
Ratio  £  =  0.1 

-1.5038 

- 

- 

-1.5038 

-1.5038 

-6.8838 

- 

- 

-6.8838 

-6.8838 

From  Table  (5.5)  it  can  be  seen  that  the  design  goal  for  the  regulator  is  to  keep  all  s-plane  complex  poles 
at  the  same  <b„,  i.e.,  at  the  same  distance  from  the  origin  as  in  the  open-loop  system.  The  closed-loop  system 
will  then  move  each  complex  pole  pair  on  a  circle  of  constant  (o„  to  a  point  where  the  damping  ratio  is 
increased  to  the  chosen  value. 

5.2.1.  Implementation 

The  regulators  were  implemented  using  a  digital  control  with  a  standard  predictor  type  observer.  [105]  A 
sampling  frequency  of  20  Hz  (sampling  time  of  50  ms)  was  chosen  so  that  the  highest  natural  frequency  of 
the  model  poles  was  less  than  one  half  the  sampling  frequency.  Using  this  sampling  frequency,  a  discrete 
time  model  of  the  cascaded  combination  of  the  continuous  eighth  order  model  driven  by  a  D/A  converter 
was  developed.  [104]  Thus  the  model  used  to  develop  the  controller  is  of  the  form 


*(*+!)  -Fx(k)  +  Gu(k) 


(34) 


y(k)  -  Cx(k)  +Du(k) 


(35) 


The  controller  equations  then  will  have  the  form 


X(k  +  1)  =  (F-GKc-Kf)X(k)  +Kfy(k) 


(36) 


u(k)  =  -KcX(k) 


(37) 


To  implement  the  pole-placement  controllers,  the  desired  s-plane  closed-loop  pole  locations  listed  in 
Table  (5.5)  were  mapped  into  the  z-plane  using  the  mapping  z=esT  where  the  values  of  s  are  the  eight  pole 
locations  listed  in  the  appropriate  columns  of  Table  (5.6)  and  T  is  the  sampling  time.  The  desired  z-plane 
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locations  listed  in  Table  (5.6)  were  obtained  in  this  manner. 


Table  5.6.  Desired  z-Plane  Locations  for  Closed-loop  Poles 


For  Damping  Ratio  £  =  0.05 

For  Damping  Ratio  £  =  0.01 

s-Plane 

z-Plane 

s- Plane 

z-Plane 

-  1.005  ±  20.08  i 

0.5106  ±  0.80231 

-2.011  ±20.0001 

0.4886  ±0.76101 

-  0.773  ±  15.44/ 

0.6894  ±  0.67 1 1  i 

-  1 .539  ±  15.3151 

0.6675  ±0.64171 

-  0.279  ±  5.98i 

0.9423  ±0.2905i 

-  0.599  ±  5.9571 

0.9278  ±0.28481 

-1.5038 

0.9276 

-1.5038 

0.9276 

-6.8838 

0.7088 

-6.8838 

0.7088 

Using  the  desired  z-plane  locations  and  the  discrete  time  model  for  T=  0.05  seconds  given  Eqs.  (34)  and 
(35),  the  feedback  gains  for  the  two  regulators  were  calculated  using  the  Kautsky  algorithm.  [103,104]  The 
results  appear  in  Table  (5.7). 


Table  5.7.  Regulator  Feedback  Matrices 


Ka  (for  C  =  0.05) 

4.644 

-3.508 

-1.181 

-10.256 

5.979 

-6.680 

1.595 

-2.921 

9.053 

41.255 

44.088 

-.4.732 

1.484 

5.852 

-10.797 

-29.826 

Ltd  (for  £  =  0.05) 

-10.750 

6.591 

-1.755 

2.441 

14.907 

-52.969 

-2.432 

3.900 

-20.034 

6.757 

-4.734 

-12.069 

8.974 

0.090 

4.512 

0.835 

Kd  (for  C  =  0.1) 

3.682 

-1.499 

-5.476 

-31.487 

10.770 

-3.040 

11.898 

-26.038 

0.077 

-10.057 

-0.948 

1.122 

12.412 

15.811 

-4.810 

-17.980 

Ltd  (for  £  =  0.1) 

-10.759 

6.591 

-1.755 

2.441 

14.907 

-0.530 

-2.432 

3.900 

-20.034 

6.757 

-4.734 

-12.068 

8.974 

0.090 

4.512 

0.834 

K0  (for  LQG/ 
LTR) 

12.328 

15.120 

6.256 

3.580 

9.530 

-0.118 

-0.547 

-12.674 

24.199 

8.033 

-1.413 

0.167 

13.777 

-6.314 

-3.118 

-43.107 

LT0  (for  LQG/ 
LTR) 

0.152 

0.050 

-0.044 

-0.699 

0.383 

-0.161 

-0.288 

0.152 

-0.771 

0.020 

-0.371 

-0.770 

0.198 

1.008 

-0.130 

-0.529 

Two  requirements  were  placed  on  the  observer  poles.  First  the  real  parts  were  chosen  so  that  the  time 
constants  of  the  observer  are  short  compared  to  those  of  the  closed-loop  regulator.  Second,  the  observer  Eq. 


70 


(36)  must  be  open-loop  stable.  A  choice  of  eight  s-plane  poles  uniformly  spaced  on  the  interval  [-20.  -40] 
and  then  mapped  into  the  z-plane  using  the  mapping  z=esT  satisfied  these  requirements  for  both  regulators. 
Therefore,  the  same  observer  was  used  for  both  regulators.  Table  (5.8)  contains  the  s-plane  observer  poles 
and  the  corresponding  z-plane  locations  for  T=  0.05  seconds. 


Table  5.8.  Observer  Poles 


s-plane 

z-plane 

-20.0000 

0.3679 

-22.8571 

0.3189 

-25.7143 

0.2765 

-28.5714 

0.2397 

-31.4286 

0.2077 

-34.2857 

0.1801 

-37.1429 

0.1561 

-40.0000 

0.1353 

Because  the  calculation  of  the  observer  gain,  Kf,  is  a  dual  to  the  calculation  of  the  regulator  gain,  IQ,  the 
same  algorithm  and  software  may  be  used  for  calculating  Kf.  [106]  The  observer  gains  for  the  two  regulators 
are  also  listed  in  Table  (5.7). 

The  feedback  and  observer  gains  used  for  the  LQG/LTR  are  also  included  in  Table  (5.7). 

The  controllers  were  implemented  using  Quick  Basic  programming.  To  verify  that  the  controllers  were 
correctly  implemented,  a  step  response  was  calculated  using  Eq.  (36).  The  step  response  of  each  imple¬ 
mented  controller  was  measured  and  compared  to  the  designed  controller  step  response.  The  step  response  of 
the  eigenvalue  assignment  controller  for  £  =  0.05  and  the  step  response  for  the  LQG/LTR  controller  appears 
in  Fig.  (5.29).  In  each  instance,  the  measured  response  was  identical  to  the  response  predicted  by  simulation. 
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Figure  5.24  Controller  Step  Responses 


5.2.2.  Results 

Figure  5.25  shows  the  open-loop  time  response  of  the  test  article  when  initially  excited  in  the  first  mode. 
The  natural  damping  is  approximately  0.3%.  Closing  the  loop  with  the  Robust  Regulator  designed  in  Section 
5.2  with  £  =  0.05  yielded  the  results  shown  in  Fig.  5.31.  The  envelope  of  the  open-loop  response  is  also 
shown  for  comparison.  The  controller  performed  as  was  predicted  in  simulation.  The  effect  of  the  dead-band 
can  be  seen  when  the  control  output  no  longer  drives  the  wires.  At  this  point,  the  oscillations  decay  at  the 
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natural  damping  rate. 


Time  (s) 


0.4 

0.2 

0.0 

-0.2 

-0.4 


Figure  5.25  Open-Loop  Time  Response 
Sensor  1 


To  test  the  “robustness”  of  the  controller,  an  additional  2.25#  was  added  to  the  bottom  mass  and  1.25#  to 
the  middle  mass.  The  new  open-loop  time  response  is  plotted  in  Fig.  5.27.  The  additional  mass  resulted  in 
decreased  damping  and  an  increased  modal  frequencies.  As  can  be  seen  in  Fig.  5.28,  the  performance  of  the 
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be  insensitive  to  parameter  uncertainty,  linearized  model  error  and  sensor  noise.  To  overcome  these  difficul¬ 
ties,  a  robust  controller  was  designed  using  the  linear  quadratic  Gaussian  loop  transfer  recovery  (LQG/LTR) 
methodology.  The  block  diagram  of  the  controller  and  plant  is  shown  in  Fig.  5.29. 


Figure  5.29  Controller  and  Plant  Block  Diagram 
5.3.1.  Nominal  LQG/LTR  Design  Method 

The  fundamental  equations  needed  in  the  LQG/LTR  design  are  given  in  Eqs.  (38  -  43).  These  equations 
are  developed  in  Doyle  [106],  Maciejowski  [109],  and  Ridgely  [110]. 

j(0  =  Ax(t)  +By(t)  +  r»y(f)  (38) 


J  =  lim£{J„  (sT(t)Qcf(t)  +ltT{t)Rcy(t))dt} 


(39) 


o  =  a  pf + PfA T + rrT  -  i  pfTcp, 


(40) 


0  =  AtPc  +  PcA-  PcBR-c'BtPc  +  Qc 
Kc  =  R~'BTPe 


(41) 
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where 


Qc  -  CTC+ q2CTC  (42) 

The  basic  design  methodology  is  to  determine  Kf  so  that  you  obtain  a  certain  open-loop  transfer  function 
shape  and  then  to  determine  IQ  so  that  you  recover  the  robustness  stability  properties  desired.  The  loop  gain 
is  varied  using  p.  The  loop  shape  is  changed  using  r.  Finally,  the  amount  of  stability  recovered  is  deter¬ 
mined  by  the  parameter  q1.  After  K,  and  Kc  are  obtained,  the  controller  is  then  implemented  using  the  state 
equations  in  Eq.  (43). 


Y,  =  (A  -  BKC  -  K,C)  Xc  +  Kf 
Uc  =  -KCXC 


(43) 


An  initial  controller  design  was  performed  without  restricting  the  control  effort.  The  parameters  for  this 
design  were  p  =  0.2,  T  =  B  and  q2  =  lxlO3.  Figure  (5.30)  shows  the  amount  of  recovery  achieved  with  this 
controller.  Notice  that  we  do  not  have  perfect  recovery  around  the  frequency  of  interest  (3.6  rad/sec), 
although  from  Fig  (5.31),  we  see  that  we  are  already  saturating  the  actuators  (control  effort  >  1.2V).  The 
saturation  problem  is  caused  by  the  q2  parameter.  As  q2  is  increased,  the  a  accuracy  of  the  loop  transfer 
recovery  increases  (i.e.  controller  performance);  however,  the  magnitude  of  the  control  force  u(t)  also 
increases.  Because  of  limited  control  effort  wk&i  using  NiTiNOL  wire,  the  saturation  effect  will  always  be  a 
major  system  constraint. 

In  order  to  initially  overcome  this  constraint,  we  decided  to  reduce  the  amount  of  recovery  until  we 
acquired  a  controller  that  would  no  longer  saturate  the  actuators.  The  parameters  for  this  design  were 
p  =  0.2,  T  =  B  and  q2  -  50.  The  maximum  control  effort  produced  for  this  controller  was  0.9V.  However, 
Fig  (5.30)  shows  that  the  amount  of  recovery  for  this  design  was  extremely  low  which  translates  into  poor 
system  performance.  Therefore,  we  needed  to  develop  a  controller  that  would  not  only  not  saturate  our  actu¬ 
ators  but  would  also  be  insensitive  to  system  uncertainties.  We  turned  to  a  modified  structure  for  the  LQG/ 
LTR  controller  developed  by  Prakash  [111]  which  quickens  the  loop  transfer  recovery  process. 
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Figure  5.31  Comparison  of  Control  Effort 


5.3.2.  Modified  LQG/LTR  Method 

Before  we  show  the  results  obtained  using  the  modified  LQG/LTR  structure,  we  will  first  motivate  the 
physical  insight  behind  the  structure.  In  order  to  develop  the  insight  behind  the  modified  structure,  we  will 
first  demonstrate  why  observer  based  LQG/LTR  methodologies  result  in  high  gain  controllers.  The  first  main 
objective  behind  LQG/LTR  controllers  is  given  a  system  configuration  shown  in  Fig.  5.32  design  a  controller 
K(s)  such  that  P(jw)K(jw)  is  approximately  equal  to  L(jw)  which  is  the  desired  open-loop  transfer  function. 
Therefore,  the  error  transfer  function  between  the  desired  open-loop  transfer  function  and  the  actual  open- 
loop  transfer  function  is  E^s)  =  L(s)  -  PfsjK^s)  where  K^s)  is  the  transfer  function  of  an  observer  based 
controller.  From  Eq.  (43),  the  transfer  function  for  the  standard  LQG/LTR  structure  is. 

K0  =  K,  (O'1  +  Kf  +  BKr)  K f  (44) 


where  O  =  (si  -  A)'1 
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Figure  5.32  Basic  LQG/LTR  System  Configuration 


The  exact  LQG/LTR  design  procedure  depends  on  wether  the  closed-loop  system  is  broken  at  the  input 
or  output  of  the  plant.  We  will  concentrate  on  the  LQG/LTR  algorithm  if  the  loop  is  broken  at  the  output  of 
the  plant.  For  a  system  with  the  loop  broken  at  the  output  of  the  plant,  the  desired  open-loop  transfer  func¬ 
tion  becomes. 


L(s)  =  CQKf 


(45) 


Given  the  plant  transfer  function,  the  desired  open-loop  transfer  function,  and  the  controller  transfer 
function,  Prakash  showed  that  the  error  transfer  function  becomes. 

E0(s)  *  ( l  +  CQ(s)Kf)(!  +  N(s))-'N(s )  (46) 


where  N(s)  =  C (<*>-'  +BKc)-'Kf 


The  desired  goal  of  the  LQG/LTR  design  is  for  the  error  transfer  function  to  equal  zero  for  all  frequen¬ 
cies;  however,  Prakash  showed  that  this  can  only  be  true  if  N(s)  =  0  for  all  frequencies.  Since  this  cannot  be 
achieved,  the  size  of  N(jw)  must  be  small  in  some  sense  for  all  w  of  interest.  In  order  to  achieve  this,  let  K, 
be  parameterized  in  terms  of  a  parameter  a  such  that,  N(s)  =  C(0*‘  +  8Afc(o))‘,tf/-»0  pointwise  in  s  as 
o— » 0.  However,  we  want  the  state  feedback  and  observer  gains  to  be  independent  of  one  another;  therefore 
this  requires  that  C(<&''  +  ££,(0))''  -*0  pointwise  in  s  as  a  ->  0.  To  satisfy  this  requirement,  the  observer 
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based  LQG/LTR  approaches  require  that  }|  (<J)  |j  as  o  -» ».  Thus  the  standard  LQG/LTR  algorithm  is  a 

high  gain  approach.  Since  there  is  a  trade-off  between  robustness  properties  and  the  size  of  the  feedback  gain 
and  the  size  of  the  feedback  gain  is  a  major  system  constraint,  we  need  to  use  an  algorithm  which  provides 
as  low  a  gain  as  possible. 

By  allowing  ourselves  to  design  a  LQG/LTR  controller  that  is  not  observer  based  the  link  from  the  esti¬ 
mated  states  to  the  observer  can  be  eliminated  because  the  effect  of  the  above  link  on  the  output  of  the 
observer  based  controller  vanishes  asymptotically  as  The  structure  for  this  new  LQG/LTR  controller 

is  given  in  Fig  (5)  with  the  appropriate  link  missing  from  the  standard  LQG/LTR  structure. 


Figure  5.33  Block  Diagram  of  Modified  LQG/LTR  Structure 

The  feedback  gain  and  estimator  gain  vectors  are  determined  exactly  as  in  the  standard  LQG/LTR 
approach,  the  only  difference  is  the  structure  of  the  controller  after  the  gains  are  determined.  The  state  equa¬ 
tions  for  the  modified  LQG/LTR  structure  are 

X=(A-BKe)Xc  +  K,e  (47) 

U  =  -KCXC 


Since  this  controller  is  no  longer  observer  based  then  the  separation  principle  is  not  valid.  Prakash  how¬ 
ever  has  proven  that  the  design  objectives  of  the  closed-loop  stability  and  recovering  the  target  loop  shape 
can  both  be  simultaneously  achieved. 

Finally  to  demonstrate  that  the  new  structure  does  quicken  the  loop  transfer  recovery  process,  we  need  to 
return  to  the  error  transfer  function.  Prakash  showed  that  the  error  transfer  function  for  the  modified  struc¬ 
ture  is 


E0  =  N(s) 


(48) 


Also  assuming  that  the  singular  values  of  N(s)  are  small  over  a  frequency  range  of  interest  and  that  the 
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singular  values  of  CQKf  are  large  for  the  frequency  range  of  interest  (i.e.  good  controller  performance),  then 
comparing  Eqs.  (46)  and  (48)  shows  that  as  q2  is  increased,  Eq.  (48)  becomes  small  more  quickly  than  Eq. 
(46)  [111]. 

5.3  J.  Results 

Using  this  structure,  we  designed  a  new  controller  with  p  =  0.2,  r  =  B,  q}  =  100.  Figure  5.30  shows  a 
comparison  of  the  recovery  between  the  two  structures.  Notice  that  the  modified  structure  has  a  much  greater 
recovery  in  the  3.6  rad/sec  region.  Figure  5.31  shows  that  the  control  effort  for  the  modified  structure  is  less 
than  1 .2 V.  Therefore,  the  modified  structure  does  not  saturate  the  actuators. 

The  robustness  of  the  controller  was  tested  by  allowing  the  tip  mass  to  vary  from  6.5  lbs  to  9.0  lbs.  Fig¬ 
ure  5.34  shows  the  system  performance  for  both  the  initial  tip  mass  and  the  increased  tip  mass.  Notice  that 
the  system  damping  is  insensitive  to  the  variation  in  the  tip  mass. 


Figure  5.34  System  Performance  with  the  Modified  Structure  LQG/LTR  Controller 

After  the  modified  LQG/LTR  algorithm  was  applied  to  the  SISO  system,  the  algorithm  was  then  applied 
to  the  three  mass  system  (MIMO  system).  Again  an  initial  comparison  was  performed  between  the  standard 
LQG/LTR  controller  and  the  modified  LQG/LTR  controller.  Figure  5.35  shows  that  the  modified  structure 
again  produced  a  controller  with  much  less  control  effort. 
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Figure  5.36  Initial  Condition  Responses  for  Three  Mass  System 


The  first  robustness  test  was  the  addition  of  sensor  noise.  The  sensor  noise  was  a  normally  distributed 
noise  signal  with  a  variance  of  400  and  a  mean  of  zero.  Figure  5.37  contains  the  noise  corrupted  sensor  val¬ 
ues  and  the  actual  system  response  for  both  of  the  controllers.  Even  though  the  signal  to  noise  ratio  is  poor, 
the  modified  LQG/LTR  controller  damped  the  oscillations  to  the  dead  band  level  in  approximately  10  sec¬ 
onds,  while  the  standard  LQG/LTR  controller  was  unstable. 

Next,  a  parameter  variation  robustness  test  was  conducted  by  adding  mass  to  the  test  article.  The  second 
and  third  masses  were  increased  by  1.4  lbs.  Figure  5.38  contains  the  experimental  system  responses.  With 
this  parameter  variation,  both  controllers  were  able  to  damped  the  oscillations  to  the  dead  band  level  in 
approximately  15  seconds. 
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Figure  5.38  Controller  responses  with  Added  Mass  for  the  Three  Mass  Structure 


5.4.  Adaptive  Controllers 

Let  a  system  be  described  by 


x=f(x,u,  p,  0,  f) 
y  =  h(x,p,  0,  t) 


(49) 


where  x  is  the  system  state,  u  is  the  system  input,  p  and  0  together  make  up  the  system  parameters.  The  ele¬ 
ments  of  the  vector  p  are  those  parameters  which  are  either  unknown  and  constant  or  vary  with  time  in  an 
independent  fashion  and  the  elements  of  0  represent  the  parameters  which  are  under  the  control  of  the 
designer,  t  is  the  time  index.  Adaptation  then  corresponds  to  the  adjustment  of  the  control  parameter  vector  0 
to  compensate  for  the  unknown  parameter  p. 

If  the  functions  /  and  h  are  linear  mappings,  the  system  can  be  represented  by 
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X  =  A  (p,  0,  t)  x  +  B  (p,  0,  f)  u 
>  =  H(p,  0,  t)x 


(50) 


where  A,  B  and  H  are  matrices  of  bounded,  piecewise-continuous,  time  functions.  Furthermore,  if  the  system 
ii  Mme-invariant,  it  can  be  expressed  in  the  form 


i  =  A(p,  0)x  +  B  (p,  9)u 
y  =  H  (p,  0)  x 


(51) 


In  the  class  of  systems  governed  by  Eq.  (51),  the  differential  equations  governing  the  plant  are  linear  and 
the  parameters  are  unknown  but  constant.  The  goal  of  the  adaptive  system  is  to  find  a  constant  parameter 
vector,  6*  using  the  measured  signals  of  the  system  such  that  lim0  (r)  =0*  and  the  system  is  modeled  by 


x  =  A  (p,  0*.  r)  x  +  B  (p,  0’,  0  u 
y  *  H  (p,6',t)x 


(52) 


behaves  in  the  desired  fashion.  Since  6  is  adjusted  dynamically  as  a  function  of  the  states,  the  overall  system 
becomes  nonlinear.  Equation  can  then  be  rewritten  as 


x  =  A  (p,  0,  r)  x  +  B  (p,  0, t)  u 
9  =  g(x,  0, 0 


(53) 


As  time,  t,  goes  to  infinity,  the  behavior  of  the  nonlinear  system  in  Eq.  (53)  should  approach  that  of  the 
linear  system  in  Eq.  (52).  It  is  this  special  class  of  nonlinear  systems  which  shall  be  referred  to  as  adaptive 
systems  throughout  this  thesis. 

5.4.1.  Direct  vs.  Indirect  Adaptive  Control 

For  many  years,  there  have  basically  been  two  distinct  methods  for  finding  the  solution  of  the  adaptive 
control  problem.  These  are  direct  and  indirect  control.  When  the  controller  parameters,  Q(k),  are  directly 
adjusted  to  reduce  some  norm  of  the  output  error  between  the  reference  model  and  the  plant,  this  is  called 
direct  control  or  implicit  identification.  In  indirect  control,  also  referred  to  as  explicit  identification,  the 
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parameters  of  the  plant  are  estimated  as  the  elements  of  a  vector  p(k)  at  each  instant  k  and  the  parameter 
vector  9(k)  of  the  controller  is  chosen  assuming  that  p(k)  represents  the  true  value  of  the  plant  parameter 
vector,  p.  Figure  5.39  and  Fig.  5.40  respectively  show  the  direct  and  indirect  model-reference  adaptive  con¬ 
trol  structures  for  a  Linear  Time  Invariant  (LTI)  plant.  It  is  important  to  ni_ie  that  in  both  cases  efforts  have 
to  be  made  to  probe  the  system  to  determine  its  behavior  as  control  action  is  being  taken  based  on  the  most 
recent  information  available.  The  input  to  the  process  is  therefore  used  simultaneously  for  both  identification 
and  control  purposes.  However,  not  every  estimation  scheme  followed  by  a  suitable  control  action  will  result 
in  optimal  or  even  stable  behavior  of  the  overall  system.  Therefore,  considerable  care  must  be  taken  in 
blending  estimation  and  control  schemes  to  achieve  the  desired  objective  [107], 


Figure  5.39  Direct  Model-Reference  Adaptive  Control  Structure 


Figure  5.40  Indirect  Model-Reference  Adaptive  Control  Structure 


5.4.2.  Model* Reference  Adaptive  Systems 

The  Model-Reference  Adaptive  System  (MRAS)  used  previously  to  demonstrate  the  difference  between 
direct  and  indirect  control  is  one  of  the  main  adaptive  control  techniques.  Model-Reference  Adaptive  Control 
(MR AC)  was  originally  proposed  to  solve  a  problem  in  which  the  specifications  are  given  in  terms  of  a  ref¬ 
erence  model  that  describes  how  the  process  output  ideally  should  respond  to  the  command  signal.  This  is 
called  the  model-following  problem.  The  model-following  problem  can  be  solved  using  pole-placement  con¬ 
troller  design  when  the  plant  parameters  are  completely  specified  with  some  given  degree  of  certainty.  How¬ 
ever.  when  parts  of  the  plant  are  unknown  or  contain  a  high  degree  of  uncertainty,  adaptive  techniques  can 
be  used.  Looking  back  at  Fig.  5.39,  a  process  is  controlled  via  a  regulator  with  input/output  feedback  loops. 
The  desired  performance  of  the  plant  is  described  as  the  output  of  a  linear,  or  nonlinear,  reference  model,  ym, 
which  gives  the  desired  response  to  a  command  signal,  r.  The  error,  ea  is  then  used  by  the  outer  loop  to 
adjust  the  controller  parameter  vector,  0.  In  Fig.  5.40,  the  process  is  controlled  in  the  same  manner.  Again, 
the  desired  performance  is  specified  by  the  pair  { r ,  ym}.  The  plant  parameter  vector,  p(k)  ,  is  estimated  and 
used  with  the  error,  ev  by  the  outer  loop  to  adjust  the  controller  parameter  vector,  0.  In  both  cases  the  main 
problem  is  to  determine  an  adjustment  mechanism  such  that  a  stable  system  which  drives  the  output  error  to 
zero  is  obtained.  Note  that  the  feedback  control  loop,  consisting  of  the  process  and  the  regulator,  is  assumed 
to  be  faster  than  the  outer  loop,  which  adjusts  the  regulator  parameters. 

Another  class  of  adaptive  system  is  the  Self -Tuning  Regulator  (STR)  mentioned  in  the  introduction.  The 
STR  is  based  on  the  ideas  of  separating  the  estimation  of  the  unknown  plant  parameters  from  the  design  of 
the  controller.  The  basic  STR  consists  of  a  parameter  estimator,  a  linear  controller  and  a  block  which  deter¬ 
mines  the  control  parameters  from  the  estimated  parameters.  See  Fig.  5.41.  The  unknown  plant  parameters 
are  estimated  on-line  using  a  recursive  estimation  method.  The  estimated  parameters  are  used  in  the  control 
design  equation  as  if  they  are  true,  i.e.,  the  uncertainties  of  the  estimates  are  not  considered.  Any  control 
design  method  can  be  used,  e.g.,  minimum  variance,  Linear  Quadratic  Gaussian/Loop  Transfer  Recovery 
(LQG/LTR),  pole-placement,  model-following,  etc.  The  design  method  is  chosen  depending  on  the  specifica¬ 
tions  of  the  closed-loop  system.  The  STR  was  originally  developed  for  sampled-data  systems,  but  has  since 
been  extended  to  continuous-time.  Hybrid  algorithms  have  also  been  developed  [108],  STRs  have  been  stud¬ 
ied  extensively  and  enjoy  a  wide  popularity  due  to  their  flexibility.  They  are  easily  understood  and  imple¬ 
mented  [108].  STRs  can  be  implemented  as  direct  or  indirect  controls  similar  to  the  MRAS.  In  many  cases, 
there  is  a  direct  correspondence  between  STRs  and  MRAS. 
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Figure  5.41  Self-Tuning  Regulator  Structure 


5.4J.  Conventional  Direct  MRAC 

The  first  controller  implemented  on  the  structure  was  the  Direct  MRAC  shown  in  Fig.  5.42.  This  gives  a 
basis  for  comparison  between  direct  and  indirect  control.  The  Direct  MRAC  was  implemented  using  the 
algorithm  presented  in  Section  II.  (See  Listing  C.I  in  See  Appendix  C.)  Fig.  5.43  shows  a  plot  of  the  open- 
loop  response  envelope,  the  desired  response  envelope,  and  the  closed-loop  response  achieved.  As  can  be 
seen,  closed-loop  system  adapts  to  the  reference-model  response  until  deadband  is  reached  (approximately 
11  seconds),  at  which  point  adaptation  is  turned  off.  In  Fig.  5.44,  the  control  parameter  vector  8  (A)  has  sta¬ 
bilized  after  about  8  seconds  and  before  deadband  is  reached. 
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Time  (kT) 

Figure  5.44  Evolution  of  Control  Vector,  0(*)  (Direct  MRAC) 
The  final  values  of  the  controller  parameters  are  given  below 


0  (*) 

Final  Value 

e,(*) 

[-0.78069  0.75716] 

e0(*) 

8.92846 

e,(*) 

[l. 18814  -0.16468] 

1.  Conventional  Indirect  MRAC 

Next,  an  Indirect  MRAC  was  implemented  on  the  structure  as  shown  in  Fig.  (5.42).  Like  the  Direct 
MRAC,  the  Indirect  MRAC  was  implemented  using  the  algorithm  presented  in  Section  II.  (SeeListing  C.2  in 
See  Appendix  C.)  Figure  (5.46)  shows  a  plot  of  the  open-loop  response  envelope,  the  desired  response  enve¬ 
lope,  and  the  closed-loop  response  and  Fig.  (5.47)  shows  the  time  evolution  of  the  control  parameter  vector 
9(Jk)  .  Again,  the  parameters  stables  after  about  8  seconds  with  deadband  reached  by  11  seconds. 
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The  final  values  of  the  controller  parameters  are  given  below 


ew 

Final  Value 

e,  (*) 

[0.98276  -1.0117(3 

e0<*) 

9.36202 

®2(*) 

[l .6 1385  -0.14307] 
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6.  Identification  and  Control  Using  Neural  Networks 


The  linear  and  non-linear  mapping  properties  of  neural  networks  have  been  extensively  utilized  in  sys¬ 
tem  identification  [112-117],  Many  control  strategies  using  neural  networks  have  been  suggested  where  both 
system  identification  as  well  as  control  is  accomplished  using  multilayered  networks.  Most  of  the  control 
strategies  assume  very  little  or  no  apriori  knowledge  about  the  system  and  therefore  use  the  property  of 
adaptation  of  the  neural  networks.  Networks  used  for  identification  in  these  schemes  do  not  provide  any 
knowledge  about  the  parameters  of  the  mathematical  model  of  the  system.  In  this  section,  we  have  used  a 
neural  network  to  determine  a  linear  model  of  a  system  in  two  different  ways.  First,  we  have  used  multilay¬ 
ered  neural  networks  for  determining  the  Markov  parameters  of  a  dynamical  system  from  experimentally 
determined  input-output  sequences.  A  mathematical  model  of  the  structural  system  is  then  determined  from 
these  Markov  parameters  using  the  ERA.  Second,  we  have  developed  a  method  of  determining  a  linear  state 
space  model  of  a  system  from  experimental  test  data  using  neural  networks.  In  this  method,  a  sequence  of 
state  vectors  are  calculated  from  the  input-output  data  of  the  system.  These  state  vectors  along  with  the 
input-output  data  are  used  to  train  a  multilayered  network.  The  product  of  the  weights  of  the  trained  network 
gives  the  state  space  model  of  the  system.  The  neural  network  architectures  for  system  identification  utilize 
the  standard  backpropogation  learning  algorithm  [112]  for  training  and  are  often  large  in  size.  Therefore, 
such  networks  typically  require  very  long  training  times.  To  enhance  the  rate  at  which  such  networks  learn 
and  hence  to  reduce  the  learning  time,  we  have  developed  an  accelerated  adaptive  learning  rate  algorithm. 
This  algorithm  adjusts  the  learning  rate  used  in  the  standard  backpropogation  algorithm  at  every  epoch  so  as 
to  minimize  the  output  error  at  a  faster  rate. 

6.1.  Neural  Network  Technique  to  Generate  the  Markov  Parameters 

In  the  method  suggested  by  Bialasiewicz  et.  al  [116],  a  multilayered  feedforward  network  [see  Fig.  6.1] 
is  trained  using  the  input-output  data  of  the  system.  Once  the  network  is  trained,  the  required  Markov  param¬ 
eters  are  found  to  be  proportional  to  the  product  of  the  weighting  matrices  of  the  network. 

The  computational  simplicity  of  this  method  is  one  of  the  main  advantages  over  the  conventional 
observer  formulation  to  calculate  the  Markov  parameters  of  a  system.  In  addition,  the  noise  rejection  proper¬ 
ties  of  the  neural  networks  allow  more  accurate  estimation  of  the  Markov  parameters  from  noisy  experimen¬ 
tal  data  as  compared  to  the  observer  formulation. 
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Figure  6.1.  Neural  network  architecture  for  determining  the  Markov  parameters 


In  the  neural  network  shown  in  Figure  6. 1 .  the  activation  functions  used  by  each  of  the  hidden  layers  and 
the  output  layer  are  hyperbolic  tangent  and  a  linear  respectively.  The  hyperbolic  tangent  and  the  linear  acti¬ 
vation  functions  are  given  by 


Hyperbolic  Tangent  T (x)  =  Aftanh(^cu) 

Linear  T(r)  »  Kx 


(1) 


The  input  to  this  network  consists  of  only  the  past  inputs  of  the  plant  [see  Figure  6.1]  whose  Markov 
parameters  are  to  be  determined.  Let  the  vector  consisting  of  the  past  inputs  of  the  plant  be  denoted  by 


Z“  [«(*)] 


«(*) 

«(*-!) 


-»Wx  I 


(2) 


Let  the  number  of  neurons  in  the  first  hidden  layer,  second  hidden  layer  and  the  output  layer  are  P,  Q 
and  M  respectively,  then  the  input  to  the  first  hidden  layer  is 


v(k)  =  wl  xZ~‘  [a(Jfc)] 


(3) 


And  the  output  of  the  first  hidden  layer  is 


v(*)  =  T(wl  x  Z~'  [u  (k)  ] ) 


(4) 


Similarly,  the  input  and  output  vectors  of  the  second  hidden  layer  and  the  output  layer  can  be  written  as 
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Z  (k)  -  w2  x  v(k) 


(5) 


z(k)  =  r(w2xr(wixr'[u(k)])) 


(6) 


y(k)  =  w3xz(k) 


(7) 


y(k)  =  Ax  w3  x  r(w2  x  r(w»I  x  T'  [u  (k)  ] ) )  (8) 

where  w  1 ,  w2  and  w3  are  the  weighting  matrices  of  the  network  and  A  is  a  constant.  Equation  (8)  gives  the 
relation  between  the  output  and  the  input  of  the  network. 

Once  the  network  is  trained,  the  neurons  can  be  assumed  to  operate  in  the  linear  range  of  the  nonlinear 
activation  functions  and  will  behave  as  a  linear  network  within  a  certain  range  of  input  signals.  Then,  Eq.  (8) 
can  be  approximated  by 


y{k)  -  Ax  [vv3  x  h>2  x  wl]  x  [Z~l  [«(*)]  ]  (9) 

Now,  consider  an  nth  order  linear  discrete  time  system  described  by 

*<*+l>  =**(*) +p«(*) 

y(k)  =  CX(k) 

The  system  in  Eq.  (10)  can  also  be  described  in  terms  of  its  Markov  parameters  as  follows 

y(k)  *  [c<l>0P  C<&‘P  ...  xZ"'  [«(*)]  (11) 

where  [C<t>°P  C<X»‘P  C<I>Jp  ...  are  the  Markov  parameters  of  the  system. 

From  Eq.  (9)  and  Eq.  (11)  it  can  be  seen  that  for  a  proper  choice  of  the  number  of  past  inputs  ( N )  of  the 
plant,  the  product  of  the  weighting  matrices  of  the  network  (Eq.  (9»  is  proportional  to  the  Markov  parame¬ 
ters  of  the  system  (Eq.  (10)). 

Ax  [w3x  w2xw\)  ~  [cO°p  C4>'P  C<D2P  ...  C^*1^]  (12) 

The  Eigensystem  Realization  Algorithm  can  then  be  utilized  to  obtain  a  reduced  order  state  space  model 


% 


of  the  system  from  these  Markov  parameters. 


6 .2.  Identification  of  State  Space  Model  of  a  system  using  Neural  Networks 

Most  neural  network  identification  techniques  as  described  in  section  6.1  generate  a  neuro-model  of  a 
system  such  that  the  network  response  is  similar  to  that  of  the  system  response  for  a  given  input.  These 
neuro-models  of  systems  cannot  be  used  to  design  modem  full  state  feedback  controllers  since  no  informa¬ 
tion  about  the  parameters  of  the  linear  system  are  available.  In  1991  Chow  and  Yam  [118]  suggested  a 
method  to  identify  the  discrete  time  poles  and  zeros  of  a  system  from  its  input  output  data.  This  method 
assumes  that  the  values  of  the  transfer  function  coefficients  were  known  within  a  certain  range  for  training 
the  neural  network.  This  assumption  seriously  limits  the  applicability  of  this  method  to  real  world  problems 
because  there  is  seldom  any  information  available  about  the  model  of  a  system. 

In  this  section,  an  identification  method  based  on  feedforward  neural  networks  to  estimate  a  linear  model 
of  a  system  has  been  presented.  In  this  method,  the  product  of  the  weights  of  a  trained  feedforward  network 
gives  the  state  space  matrices  of  a  system. 

Consider  a  system  described  by  the  following  state  space  representation 

x(k  +  1)  =Ax(k)  +Bu(k) 

y  (k)  —  Cx(k)  +Du(k)  (13) 


where  x(k)  is  a  n  x  1  dimensional  state  vector  u  (k)  and  y(k)  axe  m  and  q  dimensional  input  and  output 
vectors  respectively.  They  are  described  by  the  following  equation 


x(k) 


~xx  ( ky 

*2  (*) 


-ux  ( k )- 

u(k)  = 

... 

y(k)  = 

... 

.v*>. 

(14) 


Since  a  network  with  two  hidden  layers  is  shown  to  better  approximate  any  mapping,  a  network  as 
shown  is  used  for  the  purpose  of  identification. 
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Figure  6.2.  Architecture  for  Linear  System  Identification 


Since  this  network  tries  to  estimate  models  of  linear  systems,  the  activation  functions  used  in  all  the  lay¬ 
ers  are  linear  functions  as  given  by  the  following  equation 

r  (jc)  =  Kx  (i5) 


where  AT  is  a  constant  defining  the  slope  of  the  linear  activation  function.  In  addition,  the  network  does  not 
have  any  bias  inputs.  Let  wl,  w2  and  w3  be  the  weighting  matrices  of  the  network  shown  in  Fig.  6.2.  Let 
v,  z  and  ynn  be  the  input  vectors  of  the  first  hidden  layer,  second  hidden  layer  and  the  output  layer  respec¬ 
tively.  Let  v,  z  and  ynn  be  the  outputs  of  the  first  hidden  layer,  second  hidden  layer  and  the  output  layer 
respectively.  Then  the  forward  equations  of  the  network  in  Fi*  6.2  are  as  follows. 


Define 


'*1  (*)  ■ 

'*i  (*+  1)' 

x2(k) 

x2(k+  1) 

xn(k ) 

YnnW  = 

xn(k+  1) 

a,  ( k ) 

yi(k) 

.  M*)  . 

(16) 
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Then  the  input  and  the  output  vector  of  the  first  hidden  layer  is  given  as 

v  =  wl  x  £/ 

nn 

v  =  T(v)  =  T(wl x  Unn) 

Similarly  the  input  and  output  of  the  second  hidden  layer  and  the  output  layer  are 

Z  =  w 2  x  v 

=  w2  x  T (wl  x  £/  ) 
nn' 

z  =  r  (z)  =r(w2xr(wixf/nn)) 


07) 


(18) 


Ynn  =  w3  x  z 

=  w3xr(w2xT(wl  xi/J)  (l9) 

Ynn  -  r (w3  x  r  (w2  x  T(wl  x  Unn) ) ) 

Equation  (19)  gives  the  relation  between  the  input  and  the  output  of  the  network.  Since  the  activation 
functions  used  in  all  the  layers  are  linear,  Eq.  (19)  reduces  to 

Ynn  =  [w3  X  w2  x  wl]  x  Unn  (20) 

Defining 

w321  =  w3xw2xwl  (21) 


and  substituting  for  U nn  and  Ynn  in  Eq.  (20 


x{  ( k+  1) 
x2 (k +  1) 

*„(*+!) 

y{(k) 

.  yqw 


(22) 
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Now,  the  state  space  representation  of  the  system  in  Eq.  (13)  can  be  rewritten  as 


■xx  ( k  +  1)- 

-X, (k)  - 

Jt2  (k  +  1) 

x2(k) 

xn{k+l) 

=[c  a 

Uj  (k) 

.  v*> 

_%<*)_ 

Comparing  Eq.  (22)  and  (23) 


w321  = 


[vv3  x  w2  x  w 


B- 

D. 


(24) 


From  Eq.  (24)  it  can  be  seen  that  when  a  network  is  trained  with  the  state  vector,  the  input  vector  and  the 
output  vector  of  a  system  as  shown  in  Fig.  6.2,  the  product  of  the  weighting  matrices  is  the  augmented  state 
space  matrix  of  that  system. 

For  this  identification  method,  the  input  output  record  can  be  collected  experimentally  and  the  state  vec¬ 
tors  of  the  system  can  be  calculated  off  line  from  the  input  output  sequence  using  an  algorithm  suggested  by 
Moonen  et.  al  (3]  as  outlined  below. 

The  sequence  of  experimentally  determined  input  output  sequence  is  used  to  form  two  block  Hankel 
matrices  as  shown 


where 


H,  = 

H,  = 

r 

<N 

1 _ 

.ukl. 

2 

y{k) 

y(k+ 1) 

y(k+j- 1)  - 

y(k+ 1) 

y(k  +  2) 

• 

y(k+j ) 

(k  + 1  —  1) 

y(k  +  i) 

y  (k  +  i+j  -2)  _ 

(25) 


(26) 
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Yh2  ~ 


v  (k  +  i)  y  (k  +  i  +  1) 

y(k  +  i  +  1)  y(k  +  i  +  2) 

(k  +  2i-  1)  y  (k  +  2 i) 


y(k  +  i+j-  1) 
y(k  +  i+j) 

y(k  +  2i  +j  -  2) 


(7^1  and  C/^2  are  similarly  constructed.  Here,  i  and  j  are  constants  such  that  j  »  maximum  (mi,  qi) 


Then  let 


Then  the  singular  value  decomposition  (SVD)  of  H  gives 


^121  r5H  01  y7 

U 22  -  0  0_ 


where  that  dimensions  of  the  submatrices  in  Eq.  (29)  are 


S1(  =  (2 mi  +  n)  x  (2 mi  +  n) 
U{ j  =  (mi  +  qi)  x  (2 mi  +  n) 
C/j 2=  (mi  +  qi)  x  (2 qi-n) 
U21  =  (mi  +  qi)  x  (2mi  +  n) 
U2 2=  (mi  +  qi)  x  (2 qi-n) 


Now  define  the  state  vector  sequence 


X  =  [x  (&  +  t)  x(fc  +  t'+l) 


x(k  +  i  +j  -  1)] 


X  = 


where  U  is  obtained  from  the  singular  value  decomposition  of  the  product  of  the  matrices  ^{2^11^11  as 
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given  by  the  equation 


^11*11  -  [£'«, 


<1 


0 


,Tl 


VT1 
Lr«  - 


(33) 


6.3.  Accelerated  adaptive  learning  algorithm 

Feedforward  neural  networks  used  for  system  identification  [Fig.  6.4]  tend  to  be  very  large  in  size.  Since 
such  networks  often  use  the  standard  backpropogation  algorithm  for  training,  the  number  of  iterations  and 
hence  the  time  required  for  training  tends  to  be  very  large.  In  addition,  a  network  which  learns  using  the 
backpropogation  algorithm  with  a  constant  learning  rate,  tends  to  settle  in  a  local  minimum  if  an  appropriate 
learning  rate  is  not  chosen.  In  the  standard  backpropogation  algorithm  the  weights  of  the  network  are 
updated  using  the  recursive  formula 


W(k  +  1)  =  W(k) -n^W(k)  (34) 

where  W’s  are  the  weights  of  the  network,  q  is  the  learning  rate  and  e  is  the  error  between  the  output  of  the 
network  and  the  desired  output. 

The  rate  at  which  a  network  learns  using  the  standard  backpropogation  algorithm  can  be  enhanced  by 
changing  the  learning  rate  at  the  end  of  every  epoch  of  training.  The  basic  idea  behind  the  proposed  acceler¬ 
ated  learning  algorithm  is  to  increase  the  learning  rate  if  the  sum  of  the  squared  error  (SSE)  over  one  epoch 
is  less  than  the  previous  epoch  and  to  decrease  it  if  the  SSE  is  greater.  Thus  the  new  weights  updating  equa¬ 
tion  becomes 


W(k+\)  =  W(k) -i\(kk)$-W(k)  (35) 

ae 

where  q  (kk)  is  the  learning  rate  calculated  at  the  end  of  every  epoch  for  kk  -  1, 2, ... 

If  the  SSE  continues  to  drop  over  consecutive  learning  epochs,  the  rate  of  increase  of  q  (kk)  is  acceler¬ 
ated.  Similarly,  if  the  SSE  increases  over  consecutive  training  epochs,  the  rate  of  decrease  of  q  (kk)  is  accel¬ 
erated.  While  accelerating  the  increment  in  the  learning  rate,  care  must  be  taken  to  avoid  a  sudden  increase 
in  the  SSE  towards  the  end  of  the  learning  curve.  To  avoid  such  an  increase  in  the  SSE,  the  acceleration 
coefficient  used  is  smaller  when  the  SSE  is  very  small.  The  steps  involved  in  the  algorithm  and  the  different 
conditions  which  control  the  increase  or  decrease  of  the  learning  rate  q  is  outlined  in  the  flow  diagram  in 
Fig.  6.5. 
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(SSE,  SSE  last  epoch) 


If  SSE  <  0.01  If  SSE  >=0.01 

r)  (kk  +  1)  =  [1.08x  (1,003)"]  xr[(kk)  r\  (**  +  1)  =  [1.08  x  ( 1.03) "]  x  T|  (*) 

n  =  #  of  times  SSE  has  decreased  n  =  #  of  times  SSE  has  decreased 

•  SSE  <  0.01  implies  that  the  learning  is  in  its  final  .  jj,e  increment  in  the  learning  rate  is  accelerated 

stages,  therefore  the  acceleration  coefficient  is  reduced. 


Figure  6.3.  Accelerated  Adaptive  Learning  Algorithm 

The  constants  in  the  algorithm  shown  in  Fig.  6.5  are  empirically  found  to  work  for  most  problems, 
although  they  may  be  changed  slightly  to  best  suit  a  particular  problem.  As  a  general  rule  the  initial  value  of 
t)  (kk)  is  chosen  to  be  very  small.  The  algorithm  will  then  increase  or  decrease  the  learning  rate  depending 
upon  the  SSE  at  the  end  of  each  epoch. 

6.4.  Comparison  of  the  Identified  Models 

The  cantilever  beam  was  excited  for  60  seconds  with  a  random  input  uniformly  distributed  between  ±5 
volts,  and  its  response  sampled  every  0.2  seconds.  This  data  set  was  used  to  generate  a  model  using  the  neu¬ 
ral  network  -  ERA  method  and  the  neural  network  based  direct  identification  method.  In  the  neural  network- 
ERA  procedure,  the  architecture  of  the  network  used  was  a  120 x  I20  x  60x  1 .  The  activation  functions  used 
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were  the  hyperbolic  tangent  functions  given  by  Eq.  (1).  The  size  of  the  network  was  determined  by  increas¬ 
ing  the  number  of  neurons  in  the  input  and  the  first  h'dden  layer  until  the  output  error  of  the  trained  network 
was  on  the  order  of  10~J.  In  the  direct  state  variable  identification  method,  the  parameters  used  to  form  the 
block  Hankel  matrix  of  the  system  inputs  and  outputs  were  k  =  10,  j  =  900  and  i  =  100.  Since  the  sys¬ 
tem  is  a  single  input  single  output  (SISO)  system  m  =  1  and  q  =  1 .  The  singular  value  decomposition  of 
this  Hankel  matrix  resulted  into  the  size  of  the  matrix  Sj  j  in  Eq.  6.29  to  be  203.  Then  the  order  of  the  sys¬ 
tem  was  determined  to  be  equal  to  three  as  follows 

2  mi  +  n  =  203 

n  =  203  -  (2  x  1  x  100)  (36) 

n  =  3 

A  network  of  size  4x10x10x4  was  trained  using  the  appropriate  training  vectors.  Initially,  the  stan¬ 
dard  backpropogation  algorithm  was  used  to  train  the  network.  Due  to  the  size  of  the  network,  a  very  large 
number  of  iterations  were  required  for  satisfactory  training.  To  decrease  the  number  of  training  iterations, 
the  accelerated  adaptive  learning  rate  algorithm,  outlined  in  Fig.  6.3,  was  developed  and  implemented.  A 
comparison  of  the  performance  of  the  SSE  of  the  network  with  the  standard  adaptive  learning  rate  algorithm 
and  the  accelerated  adaptive  learning  rate  algorithm  is  shown  in  Fig.  6.4.  In  the  standard  adaptive  learning 
rate  algorithm,  the  learning  rate  is  increased  by  a  factor  and  the  weights  are  updated  at  the  end  of  every 
epoch  only  if  SSE  reduces.  Otherwise,  the  learning  rate  is  reduced  by  a  factor  and  the  update  of  the  weights 
are  set  to  zero.  The  superiority  of  the  accelerated  learning,  both  in  terms  of  learning  rate  and  the  squared 
error  performance  can  be  seen  from  Fig.  6.4. 


Figure  6.4.  Comparison  of  performance  of  various  learning  rate  algorithm 
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The  state  variable  model  of  the  beam  obtained  directly  from  the  neural  network  is  given  in  the  following 
equation 


r-o.5517 

-3.5755 

-3.43871 

ro.  12421 

0.1242 

-0.3079 

-1.8064 

Bnn2  - 

0.0653 

.  0.0653 

0.2523 

0.6215  . 

.0.0137. 

Cnn2  =  [0.0012  -0.0010  0.1300] 


(37) 


(38) 


The  equivalent  continuous  time  transfer  function  (at  T  =  0.2  sec)  is 
_  ,  x  0.003*2-  0.00625  +  2.03 13 

G nn2  i  2 

+  4.9072$  +  89.01  $  +  432.5375 


(39) 


In  comparison,  the  discrete  time  state  space  model  (T  =  0.2  sec)  of  the  cantilever  beam  generated  using 
the  neural  network  -  ERA  method  was 


d> 


nn 


-0.3067  0.9420  0.0316 
-0.9420  -0.2971  -0.1595 
-0.0316  -0.1595  0.7465 


0.0510 

-0.0154 

0.0459 


=  [-0.051  -0.0154  0.045?] 


(40) 


The  continuous  time  transfer  function  equivalent  of  Eq.  (40)  is 


Gm(s) 


-0.0006$2- 0.1069$  +  1.1652 
$ 3  +  1 .4554 j2  +  89.4893$  +  1 20. 1 7 1 3 


(41) 


The  comparison  of  the  frequency  response  of  the  two  models  obtained  using  the  neural  network  based 
methods  and  the  measured  frequency  response  of  the  beam  i*  shown  in  Fig.  6.5. 
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Figure  6.5  shows  that  there  is  a  acceptable  match  between  the  frequency  of  the  first  mode  of  the  two 
models  and  the  measured  frequency  response  of  the  beam. 

6.5.  Neural  network  Based  Robust  Controller 

As  a  first  step  towards  using  a  neural  network  as  a  controller  with  real-time  adaptation  capabilities,  the 
LQG/LTR  controller  with  the  modified  structure  was  copied  by  a  multilayered  feedforward  type  neural  net¬ 
work.  The  two  controllers  were  then  compared  for  the  closed  loop  performance  and  robustness  properties 
against  plant  uncertainties. 

The  network  setup  to  copy  the  LQG/LTR  controller  is  as  shown  in  Fig.  6.6. 


Figure  6.6.  Neural  network  training  scheme  to  copy  the  LQG/LTR  controller 
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The  architecture  of  the  neural  network  used  to  replace  the  LQG/LTR  controller  is  shown  in  Fig.  6.7.  This 
architecture  follows  the  structure  suggested  by  Narendra  and  Kannan  [112]  when  the  dynamics  of  the  system 
to  be  copied  is  completely  unknown. 


ud(k)  z  ‘ 

Multi  lavered  neural 

zl 

kraSS 

zl 

z'1 

e(k) 

■eBV 

v(k) 

zl 

z{ 

zl 

z{ 

Figure  6.7.  General  network  architecture  used  to  copy  the  LQG/LTR  controller 


In  this  particular  application,  since  the  neural  network  will  replace  the  LQG/LTR  control,  which  is 
known  apriori  to  be  linear.  Hence  linear  activation  functions  were  chosen  for  each  of  the  neurons  in  the  net¬ 
work.  Since  linear  activation  functions  do  not  have  a  saturation  limit,  care  must  be  taken  to  select  small  ini¬ 
tial  weights  depending  upon  the  size  of  the  network  chosen.  One  way  to  circumvent  this  problem  is  to  chose 
linear  activation  function  with  hard  limits  such  as 

TW  =1  if  (r2  1) 

r(*)=*  if  (1  >JC>— 1)  (42) 

rw=-i  if  (x<- 1) 

Following  Eq.  (3)  through  Eq.  (9),  the  output  of  the  network  in  Fig.  6.7  with  two  hidden  layers  can  be 
written  as 


y(k )  *  [vv3  x  w2  x  wl]  x  Z~' 


u(k) 

y(k) 


Nx  1 


(43) 


The  network  is  trained  using  the  standard  backpropogation  algorithm  with  the  accelerated  adaptive  learn¬ 
ing  rate  algorithm  to  reduce  the  learning  time.  Once  the  network  satisfactorily  learns  the  LQG/LTR  control¬ 
ler,  it  is  placed  in  the  control  loop  and  the  performance  of  the  closed  loop  system  is  investigated. 

A  feedforward  network  of  size  20  x  20  x  10  x  l  was  chosen  to  copy  the  controller.  The  input  to  the  net¬ 
work  consisted  of  the  past  1 1  inputs  and  the  past  9  outputs  of  the  controller.  All  the  activation  functions  were 
linear  and  the  initial  values  of  the  weights  were  set  between  ±0.2.  The  training  set  consisted  of  300  input- 
output  pairs  of  the  response  of  the  LQG/LTR  controller  to  a  uniformly  distributed  random  input  between 
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±1  volt.  The  weighting  matrices  of  the  trained  network  were  then  used  to  implement  the  controller  according 
to  Eq.  (43). 


6.6.  Examination  of  robustness  properties 

The  closed  loop  performance  of  the  cantilever  beam  with  the  modified  LQG/LTR  controller  and  the  neu¬ 
ral  network  controller  for  nominal  and  increased  tip  mass  is  compared  in  Fig.  6.8  and  6.9.  The  criterion  used 
to  compare  the  closed  loop  performance  of  the  system  was  the  time  taken  by  the  oscillations  of  the  closed 
loop  system  to  go  below  0.2  volts.  Below  0.2  volts  the  actuators  are  in  their  dead  band  and  hence  there  is  no 
control  force  applied  on  the  beam  to  suppress  the  vibrations. 


Figure  6.8.  Closed  loop  performance  of  the  controllers  for  nominal  tip  mass 


With  a  nominal  tip  mass,  the  vibrations  of  the  closed  loop  system  damped  down  below  0.2  volts  in  about 
10  seconds  as  compared  to  the  70  seconds  for  the  open  loop  system.  From  Fig.  6.8  it  can  be  seen  that  natural 
frequency  of  the  closed  loop  system  is  slightly  higher  than  that  of  the  open  loop  system.  This  is  due  to  the 
stiffness  added  to  the  system  when  the  NiTiNOL  wires  contract  and  exert  force  on  the  cantilever  beam.  The 
skew  in  the  initial  condition  of  the  open  loop  and  closed  loop  plots  seen  in  Fig.  6.8  is  due  to  the  human  error 
involved  in  deflecting  the  beam  to  about  6  inches  before  releasing  it. 

To  examine  the  robustness  of  the  controllers  against  variations  in  mechanical  configuration,  the  tip  mass 
was  increased  by  38%.  The  performance  of  the  two  controllers  for  increased  tip  mass  is  shown  in  Fig.  6.9. 
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Figure  6.9.  Closed  loop  performance  of  the  controllers  for  increased  tip  mass 

Figures  6.8  and  6.9  show  that  the  closed  loop  performance  of  the  modified  LQG/LTR  controller  and  the 
neural  network  controller  did  not  degrade  significantly  due  to  increased  tip  mass.  In  addition  to  the  tip  mass 
uncertainty,  a  robustness  study  was  also  conducted  for  an  uncertainty  in  the  sampling  rate  of  the  controller. 
The  closed  loop  response  of  the  beam  for  an  increased  sampling  time  is  shown  in  Fig.  6.10. 


Figure  6.10.  Closed  loop  performance  with  sampling  time  increased  from  0.02  sec  to  0.1  sec 

Figure  6.10  shows  that  the  LQG/LTR  controller  took  longer  to  dampen  the  vibrations  of  the  beam  to 
below  0.2  volts  than  the  neural  network  controller  when  the  sampling  time  was  increased. 
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7.  Summary 


This  report  has  been  an  overview  of  a  fairly  extensive  study  on  applying  different  control 
techniques  to  smart  structures  using  shape  memory  alloys.  The  salient  results  are  summarized 
below. 

L  Development  of  new  shape  memory  alloys 

Indium-thallium  alloys  in  the  range  15-38  at%Tl  were  electrodeposited  from  a  sulfate 
electrolyte  using  pulsed  current.  The  relationship  between  composition  and  phase  structure  of  the 
alloys  deposited  was  investigated  with  regard  to  the  shape  memory  effect.  To  date  there  have  been 
no  reported  experimental  studies  describing  the  electrodeposition  of  In-Tl  alloys  capable  of 
undergoing  such  transformations.  Not  only  did  the  deposited  alloys  exhibit  the  shape  memory 
effect,  but  the  composition-phase  relationship  was  shown  to  deviate  from  that  reported  for  alloys  in 
an  equilibrium  condition.  The  results  show  that  the  temperature  and  the  temperature  interval  for  the 
transformation  of  the  electrodeposited  alloys  are  substantially  different  from  those  of  the  thermally 
prepared  alloys.  The  shape  recovery  for  the  electrodeposited  alloys  was  also  confirmed. 

Cu-Zn,  Au-Cd,  In-Tl  and  In-Cd  alloy  films  were  produced  using  electrolytic  techniques 
and  their  shape  memory  properties  were  evaluated.  Cyanide  solutions  were  used  for  the  Cu-Zn 
and  Au-Cd  deposits,  but  sulfate  solutions  were  preferred  for  the  indium  alloys.  The  use  of  pulsed 
current  was  beneficial  in  giving  dense  and  uniform  deposits.  The  Cu-Zn  and  Au-Cd  alloys  did 
exhibit  a  degree  of  brittle  behavior  in  the  as-deposited  condition.  Simple  bending  tests  were 
conducted  and  shape  memory  effects  were  confirmed  on  the  Cu-Zn  and  indium  alloys.  The  phases 
present  were  determined  at  different  temperatures  using  X-ray  diffraction.  A  low  temperature  unit 
was  used  for  Cu-Zn  and  Au-Cd  while  an  elevated  temperature  furnace  attachment  was  necessary 
for  the  indium  alloys.  The  phases  identified  at  ambient  temperature  for  Cu-Zn  and  Au-Cd  deposits 
were  identified  as  ones  normally  found  at  elevated  temperature  for  alloys  in  an  equilibrium 
condition.  The  phase  transformation  behavior  of  the  electrodeposited  shape  memory  alloys,  in 
general,  was  different  from  those  exhibited  by  equilibrium  alleys  prepared  using  conventional 
thermal  preparation  techniques.  The  results  show  that  electrolysis  appears  to  be  a  feasible  means 
of  producing  certain  shape  memory  thin  films  with  potentially  interesting  properties.  However, 
more  extensive  studies  are  needed  to  optimize  the  performance  of  the  electrodeposited  alloy  films. 
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We  have  acquired  a  great  deal  of  knowledge  in  the  hardware  design  and  fabrication  of 
smart  structure  test  articles.  All  the  signal  processing,  interfacing,  and  data  acquisition  circuits  are 
developed  in  our  laboratory  by  graduate  students.  We  have  developed  a  state-of-the-art  laboratory 
facility  in  the  smart  structures  area. 

3.  Identification  and  robust  control  of  smart  structures 

In  this  research  effort,  we  wanted  to  use  shape  memory  alloy  actuators  to  dampen 
vibrations  of  a  flexible  structure.  First,  a  mathematical  model  was  developed  using  the  ERA 
technique.  This  technique  allowed  us  to  generate  an  accurate  model  of  the  flexible  structure  even  in 
the  presence  of  sensor  noise.  The  presence  of  substantial  amounts  of  noise  in  the  output 
measurements  required  a  proper  selection  of  an  extra  set  of  real  observer  poles  close  to  the  origin  in 
the  asymptotically  stable  observer  formulation.  This  considerably  helped  averaging  out  the  effects 
of  noise  in  calculation  of  the  Markov  parameters  of  the  system  while  minimizing  the  calculation 
time.  The  presence  of  noise  also  drive  the  need  to  use  a  larger  window  size  and  larger  number  of 
observer  poles.  Once  the  ERA  model  was  obtained,  we  implemented  a  standard  LQG/LTR 
controller  with  no  restriction  on  controller  effort.  This  controller  saturated  the  actuators  to  an 
unacceptable  degree.  Next,  we  reduced  the  amount  of  loop  transfer  recovery  until  the  controller  no 
longer  saturated  the  actuators.  However,  this  controller  was  sensitive  to  system  uncertainties. 
Finally,  we  implemented  a  modified  LQG/LTR  controller.  This  modified  structure  quickened  the 
loop  transfer  recovery  process  allowing  us  to  design  a  controller  that  was  both  insensitive  to 
system  uncertainties  and  did  not  saturate  the  actuators.  We  successfully  implemented  this  modified 
LQG/LTR  controller  on  the  flexible  structure  and  demonstrated  its  insensitivity  to  uncertainties. 
This  experiment  showed  that  the  modified  LQG/LTR  controller  was  robust  to  tip  mass  uncertainty. 

4.  Adaptive  control  using  neural  networks 

The  adaptive  control  algorithms  have  been  uti’ized  for  designing  active  controllers  for  smart 
structure  test  articles.  Adaptive  control  schemes  require  only  a  limited  a  priori  knowledge  about  the 
system  to  be  controlled.  The  availability  of  limited  control  force  and  inherent  deadband  and 
saturation  effects  of  shape  memory  alloy  actuators  are  incorporated  in  the  selection  of  a  reference 
model.  The  vibration  suppression  properties  of  smart  structures  were  successfully  demonstrated 
by  implementing  the  conventional  model  reference  adaptive  controllers  on  test  articles.  The 
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controller  parameters  were  converged  to  steady  state  values  within  8  seconds  for  both  direct  and 
indirect  model  reference  adaptive  controllers  (MRAC). 

Various  neural  network  based  adaptive  control  techniques  were  discussed  in  this  paper. 
We  have  successfully  demonstrated  the  utilization  of  neural  networks  for  system  identification 
using  off-line  experimental  data.  A  major  problem  in  implementing  neural  network  based  MRAC 
is  the  translation  of  output  error  between  the  plant  and  the  reference  model  to  train  the  neural 
controller.  A  technique  called  iterative  inversion  to  invert  the  neural  identification  model  of  the 
forward  dynamics  of  the  plant  for  calculating  neural  controller  gains  has  been  used.  Due  to  real¬ 
time  computer  hardware  limitation,  the  performance  of  neural  network  based  adaptive  control 
systems  is  verified  using  simulation  studies.  These  results  show  that  the  neural  network  based 
MRACs  can  be  designed  and  implemented  on  smart  structures. 
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